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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 


L TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
5 polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods, 

2. BACKGROUND 

- 1 Technology aimed at the discovery of protein factors (including e.g. , cytokines, such as 
10 lymphokines, interferons, circulating soluble factors, chemokines, and interleukins) has matured 
rapidly 6yer the past decade. The now routine hybridization cloning and expression cloning 
techniques done novel polynucleotides "directly" in the sense that they rely on information 
directly related to the discovered protein (i.e., partial DNA/amino acid sequence of the protein in 
the case of hybridization cloning; activity of the protein in the case of expression cloning). More 
15 recent "indirect" cloning techniques such as signal sequence cloning, which isolates DNA 

sequences based on the presence of a now well-recognized secretory leader sequence motif, as 
well as various PCR-based or low stringency hybridization-based cloning techniques, have . 
advanced the state of the art by making available large numbers of DNA/amino acid sequences 
for proteins that are known to have biological activity, for example, by virtue of their secreted 
20 nature in the case of leader sequence cloning, by virtue of their cell or tissue source in the case of 
PCR-based techniques, or by virtue of structural similarity to other genes of known biological 
activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 
25 genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 
and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
30 isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic 
variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more 
epitopes present on such polypeptides, as well as hybridomas producing such antibodies. 
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The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such 
polynucleotides and cells genetically engineered to express such polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic acid 
5 sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 
The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, 
diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid 
sequences are designated as SEQ ID NO: 1 -5497. The polypeptides sequences are designated SEQ 
10 ID NO: 5498- 1 0994. The nucleic acids and polypeptides are provided in the Sequence Listing. In 
the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is guanine; T is 
thymine; and N is any of the four bases. In the amino acids provided in the Sequence Listing, * 
corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
1 5 hybridize to the complement of SEQ ID NO: 1 -5497 under stringent hybridization conditions; 
nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid 
sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific 
domain or truncation of the peptides encoded by SEQ ID NO: 1-5497. A polynucleotide * ^ ~ « ^ 
comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SiEQ 
20 ID NO: 1 -5497 or a degenerate variant or fragment thereof. The identifying sequence can be 1 00 
base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence information 
from the nucleic acid sequences of SEQ ID NO: 1 -5497. The sequence information can be a 
segment of any one of SEQ ID NO: 1 -5497 that uniquely identifies or represents the sequence 
25 information of SEQ ID NO: 1-5497. 

A collection as used in this application can be a collection of only one polynucleotide. The 
collection of sequence information or identifying information of each sequence can be provided on 
a nucleic acid array. In one embodiment, segments of sequence information is provided on a 
nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
30 to detect full-match or mismatch to the polynucleotide that contains the segment. The collection 
can also be provided in a computer-readableformat 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their 
3 5 reverse or direct complements) according to the invention have numerous applications in a variety 
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of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readablemedia, use in sequencing 
full-length genes, use for chromosome and gene mapping, use in the recombinant production of 
protein, and use in the generation of anti-sense DN A or RNA, their chemical analogs and the like. 
5 In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1 -5497 or novel 

segments or parts of the nucleic acids of the invention are used as primers in expression assays that 
are well known in the art. In a particularly preferred embodiment, the nucleic acid sequences of 
SEQ ID NO: 1 -5497 or novel segments or parts of the nucleic acids provided herein are used in 
diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath 
10 et al., Science 258:52-59 (1 992), as expressed sequence tags for physical mapping of the human 
genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-5497; a 
polynucleotide comprising any of the full length protein coding sequences of SEQ ID NO: 1 -5497; 

15 and a polynucleotide comprising any of the nucleotide sequences of the mature protein coding 
sequences of SEQ ID NO: 1 -5497. The polynucleotides of the present invention also include, but 
are not limited to, a polynucleotide that hybridizes under stringent hybridization conditions to (a) . . 
the complementof any one of the nucleotide sequences set forth in SEQ ID NO: 1 -5497; (b) a 
nucleotide sequence encoding any one of the amino acid sequences set forth in the Sequence Listing * - ; 

20 (e.g., SEQ ID NO: 5498- 1 0994); (c) a polynucleotide which is an allelic variant of any ; 
polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g. 
orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an amino acid 
sequence set forth in the Sequence Listing. 

25 The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding 
full length or mature protein. Polypeptides of the invention also include polypeptides with 
biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence 
set forth in SEQ ID NO: 1-5497; or (b) polynucleotides that hybridize to the complement of the 

30 polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically 
active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial 
equivalents" thereof (e.g., with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% 
amino acid sequence identity) that preferably retain biological activity are also contemplated. The 
polypeptides of the invention may be wholly or partially chemically synthesized but are preferably 
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produced by recombinant means using the genetically engineered cells (e.g. host cells) of the 
invention. 

The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
5 hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The invention also provides host cells transformed or transfected with a polynucleotide of 
the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 

10 under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
techniques known to those skilled in the art of molecular biology. These techniques include use 

15 as hybridization probes, use as oligomers, or primers, for PGR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 

20 using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

25 The polypeptides according to the invention can be used in a variety of conventional 

procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 

30 markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
pharmaceutically acceptable carrier. 
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In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 
5 polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 
identification of subjects exhibiting a predisposition to such conditions. The invention provides 
a method for detecting the polynucleotides of the invention in a sample, comprising contacting 
the sample with a compound that binds to and forms a complex with the polynucleotide of 
10 interest for a period sufficient to form the complex and under conditions sufficient to form a 
complex and detecting the complex such that if a complex is detected, the polynucleotide of 
interest is detected. The invention also provides a method for detecting the polypeptides of the 
invention in a sample comprising contacting the sample with a compound that binds to and forms 
a complex with the polypeptide under conditions and for a period sufficient to form the complex 
15 and detecting the formation of the complex such that if a complex is formed, the polypeptide is 
detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 
20 monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 
recited above. 

The invention also provides methods for the identification of compounds that modulate 
(ie. 9 increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such methods can be utilized, for example, for the identification of compounds 

25 that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compounds and other substances that interact with (e.g., 
bind to) the polypeptides of the invention. The invention provides a method for identifying a 
compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 

30 polypeptide/compound complex, wherein the complex drives expression of a reporter gene 
sequence in the cell; and detecting the complex by detecting the reporter gene sequence 
expression such that if expression of the reporter gene is detected the compound that binds to a 
polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment which involve the 

35 administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
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symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or 
disorders as recited herein comprising administering compounds and other substances that 
modulate the overall activity of the target gene products. Compounds and other substances can 
effect such modulation either on the level of target gene/protein expression or target protein 
activity. 

The polypeptides of the present invention and the polynucleotides encoding them are also 
useful for the same functions known to one of skill in the art as the polypeptides and 
polynucleotides to which they have homology (set forth in the sequence listing). If no homology 
is set forth for a sequence, then the polypeptides and polynucleotides of the present invention are 
useful for a variety of applications, as described herein, including use in arrays for detection. 


4. DETAILED DESCRIPTION OF THE INVENTION 
4.1 DEFINITIONS 

- It must be noted that as used herein and in the appended claims, the singular forms "a", 
/'an" and "the" include plural references unless the context clearly dictates, otherwise. 

The term "active" refers to those forms of the polypeptide which retain the biologic 
and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologically active" or "biological activity" refer to a protein or peptide ' 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
Likewise "immunologically active" or "immunological activity" refers to the capability of the 
natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 
extracellular or intracellular membrane trafficking, including the export of secretory or 
enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded molecules 
may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between the nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. 


WO 01/79449 



PCT/US01/08656 


The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 
and continuous source of germ cells for the production of gametes. The term "primordial germ 
cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 
are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
not only populate the germ line and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides which 
modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 
elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonucleotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 
sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded ondouble-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T 
(U). It is contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences 
provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the genome and short oligonucleotide linkers, or 
from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 
acid which is capable of being expressed in a recombinant transcriptional unit comprising 
regulatory elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 
residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and 
most preferably at least about 1 7 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 
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nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 15 to about 50 nucleotides, more preferably from about 17 to 30 
nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can 
5 be used in polymerase chain reaction (PCR), various hybridization procedures or microarray 
procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A 
fragment or segment may uniquely identify each polynucleotide sequence of the present 
invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
ID NO: 1-5497. 

10 Probes may, for example, be used to determine whether specific mRNA molecules are 

present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
described by Walsh et al. (Walsh, P.S. et al., 1992, PCR Methods Appl 1 :241-250). They may 
be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the 
art. Probes of the present invention, their preparation and/or labeling are elaborated in 

15 Sambrook, J. et aL, 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, NY; or Ausubel, F.M. et al., 1 989, Current Protocols in Molecular Biology, John 
Wiley & Sons, New York NY, both of which are incorporated herein by reference in their 
entirety. ' 

The nucleic acid sequences of the present invention also include the sequence 

20 information from the nucleic acid sequences of SEQ ID NO: 1-5497. The sequence information 
can be a segment of any one of SEQ ID NO: 1-5497 that uniquely identifies or represents the 
sequence information of that sequence of SEQ ID NO: 1-5497. One such segment can be a 
twenty-mer nucleic acid sequence because the probability that a twenty-mer is fully matched in 
the human genome is 1 in 300. In the human genome, there are three billion base pairs in one set 

25 of chromosomes. Because 4 20 possible twenty-mers exist, there are 300 times more twenty-mers 
than there are base pairs in a set of human chromosomes. Using the same analysis, the 
probability for a seventeen-mer to be fully matched in the human genome is approximately 1 in 
5. When these segments are used in arrays for expression studies, fifteen-mer segments can be 
used. The probability that the fifteen-mer is fully matched in the expressed sequences is also 

30 approximately one in five because expressed sequences comprise less than approximately 5% of 
the entire genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a segment can 
be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome 
with a single mismatch is calculated by multiplying the probability for a full match ( 1 ^ ) times the 

35 increased probability for mismatch at each nucleotide position (3 x 25). The probability that an 
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eighteen mer with a single mismatch can be detected in an array for expression studies is 
approximately one in five. The probability that a twenty-mer with a single mismatch can be 
detected in a human genome is approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
5 amino acids without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related nucleic 
acid sequences. For example, a promoter is operably associated or operably linked with a coding 
sequence if the promoter controls the transcription of the coding sequence. While operably 
linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 

10 elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 
transcription/translation of the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a number of 
differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in 
its differentiation capability in comparison to a totipotent cell. 

15 The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 

peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules.- A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 
preferably at least about 9 amino acids and most preferably at least about 17 or more amino 

20 acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 
than 150 amino acids and most preferably less than 1 00 amino acids. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 

25 have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the full 
length protein which may include any leader sequence or any processing sequence. 

30 The term "mature protein coding sequence" means a sequence which encodes a peptide 

or protein without a signal or leader sequence. The "mature protein portion" means that portion 
of the protein which does not include a signal or leader sequence. The peptide may have been 
produced by processing in the cell which removes any leader/signal sequence. The mature 
protein portion may or may not include an initial methionine residue. The methionine residue 

35 may be removed from the protein during processing in the cell. The peptide may be produced 
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synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 

The term "derivative" refers to polypeptides chemically modified by such techniques as 
ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 
5 attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 

substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur 
in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 

10 recombinant DNA techniques. Guidance in determining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 
the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 
or by replacing amino acids with consensus sequence. 

1 5 Alternatively, recombinant variants encoding these same or similar polypeptides may be 

synthesized or selected by making use of the "redundancy" in the genetic code. Various codon 
substitutions, such as the silent changes which produce various restriction sites, may be 
introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in 

20 the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain 
affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, i.e. 9 conservative amino 

25 acid replacements. "Conservative" amino acid substitutions may be made on the basis of 

similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

30 glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 
"deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 
amino acids. The variation allowed may be experimentally determined by systematically making 
insertions, deletions, or substitutions of amino acids in a polypeptide molecule using 

35 recombinant DNA techniques and assaying the resulting recombinant variants for activity. 
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Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
5 characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover 
rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
for expression, scale up and the like in the host cells chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 
eliminate disulfide bridges. 

10 The terms "purified" or "substantially purified" as used herein denotes that the indicated 

nucleic acid or polypeptide is present in the substantial absence of other biological 
macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 
preferably at least 99% by weight, of the indicated biological macromolecules present (but water, . 

15 buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from 
at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 

20 the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 
polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) 

25 expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 
bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant microbial" 
defines a polypeptide or protein essentially free of native endogenous substances and 
unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; polypeptides or 

30 proteins expressed in yeast will have a glycosylation pattern in general different from those 
expressed in mammalian cells; 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 
or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 
comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 

35 having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
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or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and termination sequences. Structural units intended for use 
in yeast or eukaryotic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 
5 protein is expressed without a leader or transport sequence, it may include an amino terminal 
methionine residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably integrated 
a recombinant transcriptional unit into chromosomal DNA or carry the recombinant 

10 transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked 
to the DNA segment or synthetic gene to be expressed. This term also means host cells which 
have stably integrated a recombinant genetic element or elements having a regulatory role in 
gene expression, for example, promoters or enhancers. Recombinant expression systems as 

1 5 defined herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells 
can be prokary otic or eukaryotic. 

The term "secreted" includes a protein that is transported across or through a membrane, 
including transport as a result of signal sequences in its amino acid sequence when it is 

20 expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g. , soluble proteins) or partially (e.g. , receptors) from the cell in which they are 
expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, 

25 P.A. and Young, P.R. (1992) Cytokine 4(2):134 -143) and factors released from damaged cells 
(e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 
16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 
30 may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in the 
art as stringent. Stringent conditions can include highly stringent conditions (i.e., hybridization 
to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 
35 65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent conditions (i.e., 
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washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions are 
described herein in the examples. 

In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 
5 14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 
60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and amino acid 
sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 

10 functional dissimilarity between the reference and subject sequences. Typically, such a 

substantially equivalent sequence varies from one of those listed herein by no more than about 
35% (i.e., the number of individual residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the corresponding reference sequence, divided 
by the total number of residues in the substantially equivalent sequence is about 0.35 or less). 

15 Such a sequence is said to have 65% sequence identity to the listed sequence. In one 

embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a 
listed sequence by no more than 30% (70% sequence identity); in a variation of this 
embodiment, by no more than 25% (75% sequence identity); and in a further variation of this 
embodiment, by no more than 20% (80% sequence identity) and in a further variation of this 

20 embodiment, by no more than 10% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, e.g., 
mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more 
preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 

25 at least 98% identity, and most preferably at least 99% identity. Substantially equivalent 
nucleotide sequences of the invention can have lower percent sequence identities, taking into 
account, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 

30 identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 
about 99% sequence identity. For the purposes of the present invention, sequences having 
substantially equivalent biological activity and substantially equivalent expression characteristics 
are considered substantially equivalent. For the purposes of determining equivalence, truncation 

35 of the mature sequence {e.g. , via a mutation which creates a spurious stop codon) should be 
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disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. 
(1990) Methods Enzymol. 183:626-645). Identity between sequences can also be determined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of the cell 
5 types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 
or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
10 of nucleic acids into a suitable host cell by use of a virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
using known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by attaching the 
1 5 suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 
with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
determined. As described above, a UMF will increase the frequency of uptake of a linked ■ . 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
20 context dictates otherwise. 

4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

The isolated polynucleotides of the invention include a polynucleotide comprising the 

25 nucleotide sequences of SEQ ID NO: 1-5497; a polynucleotide encoding any one of the peptide 
sequences of SEQ ID NO: 5498-10994; and a polynucleotide comprising the nucleotide 
sequence encoding the mature protein coding sequence of the polypeptides of any one of SEQ ID 
NO: 5498-10994. The polynucleotides of the present invention also include, but are not limited 
to, a polynucleotide that hybridizes under stringent conditions to (a) the complement of any of 

30 the nucleotides sequences of SEQ ID NO: 1-5497; (b) nucleotide sequences encoding any one of 
the amino acid sequences set forth in the Sequence Listing; (c) a polynucleotide which is an 
allelic variant of any polynucleotide recited above; (d) a polynucleotide which encodes a species 
homolog of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide, 
comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 549840994. 

35 Domains of interest may depend on the nature of the encoded polypeptide; e.g. , domains in 
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receptor-like polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic 
domains, or combinations thereof; domains in immunoglobulin-like proteins include the variable 
immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and 
substrate binding domains; and domains in ligand polypeptides include receptor-binding 
5 domains. 

The polynucleotides of the invention include naturally occurring or wholly or partially 
synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
region of the cDNA. 

1 0 The present invention also provides genes corresponding to the cDNA sequences disclosed 

herein. The corresponding genes can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include the preparation of probes or primers 
from the disclosed sequence information for identification and/or amplification of genes in 
appropriate genomic libraries or other sources of genomic materials. Further 5' and 3' sequence can 

1 5 be obtained using methods known in the art. For example, full length cDNA or genomic DNA that 
corresponds to any of the polynucleotides of SEQ ID NO: 1 -5497 can be obtained by screening 
appropriate cDNA or genomic DNA libraries under suitable hybridization conditions usingany of. ; 
the polynucleotides of SEQ ID NO: 1 -5497 or a portion thereof as a probe. Alternatively, the 
polynucleotides of SEQ ID NO: 1 -5497 may be used as the basis for suitable primer(s) that allow 

20 identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and sequences 
(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representative fragment or segment information, or novel segment information for the full-length 

25 gene. 

The polynucleotides of the invention also provide polynucleotides including nucleotide 
sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 
75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 

30 88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 
least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 
sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
of SEQ ID NO: 1-5497, or complements thereof, which fragment is greater than about 5 

35 nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 
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preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 
are selective for (z.c. specifically hybridize to any one of the polynucleotides of the invention) 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from other polynucleotide sequences in 
5 the same family of genes or can differentiate human genes from genes of other species, and are 
preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1 -5497, 

10 a representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% 
identical, to SEQ ID NO: 1 -5497 with a sequence from another isolate of the same species. 
Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules 
coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, 
in the coding region of an ORF, substitution of one codon for another codon that encodes the same 

1 5 amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present invention, 
including SEQ ID NO: 1 -5497 can be obtained by searching a database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1 993) and Altschul 

20 S.F. etal. J. Mol. Biol. 21:403-410(1990)). Alternatively a FASTA version 3 search against 
Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 

25 acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 
polynucleotides. 

30 The nucleic acid sequences of the invention are further directed to sequences which 

encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 
native or variant polynucleotide. There are two variables in the construction of amino acid 
sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids 

35 encoding the amino acid sequence variants are preferably constructed by mutating the 


WO 01/79449 



PCT/US01/08656 


polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic 
acid alterations can be made at sites that differ in the nucleic acids from different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
will typically be modified in series, e.g., by substituting first with conservative choices (e.g., 
5 hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant 
choices (e.g., hydrophobic amino acid to a charged amino acid), and then deletions or insertions 
may be made at the target site. Amino acid sequence deletions generally range from about 1 to 
30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid 
insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one 

10 hundred or more residues, as well as intrasequence insertions of single or multiple amino acid 
residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, 
preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal 
sequences necessary for secretion or for intracellular targeting in different host cells and 
sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein. 

15 In a preferred method, polynucleotides encoding the novel amino acid sequences are 

changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 
nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 

20 those of skill in the art and this technique is exemplified by publications such as, Edelman et aL, 
DNA 2: 1 83 ( 1 983). A versatile and efficient method for producing site-specific changes in a 
polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 
(1982). PCR may also be used to create amino acid sequence variants of the novel nucleic acids. 
When small amounts of template DNA are used as starting material, primer(s) that differs 

25 slightly in sequence from the corresponding region in the template DNA can generate the desired 
amino acid variant. PCR amplification results in a population of product DNA fragments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
primer. The product DNA fragments replace the corresponding region in the plasmid and this 
gives a polynucleotide encoding the desired amino acid variant. 

30 A further technique for generating amino acid variants is the cassette mutagenesis 

technique described in Wells et aL, Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et aL, supra, and Current 
Protocols in Molecular Biology, Ausubel et aL Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a functionally equivalent 

35 amino acid sequence may be used in the practice of the invention for the cloning and expression 


WO 01/79449 PCT/US01/08656 
of these novel nucleic acids. Such DNA sequences include those which are capable of 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used 
to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
5 domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known 
to those of skill in the art and can include, for example, methods for determining hybridization 

10 conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 
protein coding sequences corresponding to any one of SEQ ID NO: 1-5497, or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also 

15 included are the cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful 
nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g. , 

20 plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 
selectable marker for the host cell. Vectors according to the invention include expression 

25 vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 
according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 
organism or part of a multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic acid 
having any of the nucleotide sequences of SEQ ID NO: 1-5497 or a fragment thereof or any 

30 other polynucleotides of the invention. In one embodiment, the recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 
having any of the nucleotide sequences of SEQ ID NO: 1-5497 or a fragment thereof is inserted, 
in a forward or reverse orientation. In the case of a vector comprising one of the ORFs of the 
present invention, the vector may further comprise regulatory sequences, including for example, 

35 a promoter, operably linked to the ORF. Large numbers of suitable vectors and promoters are 
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known to those of skill in the art and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by way of example. 
Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNHSa, pNH16a, pNHl 8a, 
pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). 
5 Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et aL, 
Nucleic Acids Res. 19, 4485-4490 (1 991), in order to produce the protein recombinantly. Many 

10 suitable expression control sequences are known in the art. General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufman, Methods in 
Enzymology 185, 537-566 (1990). As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host cell which has been transformed 

1 5 (transfected) with the ligated polynucleotide/expression control sequence. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 

20 kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli 
and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct 

25 transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 
assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 

30 periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 
characteristics, e.g., stabilization or simplified purification of expressed recombinant product. 
Useful expression vectors for bacterial use are constructed by inserting a structural DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 

35 signals in operable reading phase with a functional promoter. The vector will comprise one or 
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more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be 
5 employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 

10 Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 
sequence to be expressed. Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an 

15 additional period. Cells are typically harvested by centrifiigation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et aL, Nat. Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

20 against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intramuscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 


25 4.3 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO: 1-5497, or fragments, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 

30 encoding a protein, e.g. , complementary to the coding strand of a double-stranded cDNA 

molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 
molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 
100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fragments, homologs, derivatives and analogs of a protein of any of SEQ ID 
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NO: 5498-10994 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ ID 
NO: 1-5497 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 
5 to the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention. The term 
"noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

10 Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., SEQ 

ID NO: 1-5497), antisense nucleic acids of the invention can be designed according to the rules 
of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 
complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide 
that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, 

15 the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis or enzymatic ligation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 

20 chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 

25 include: 5-fluorouracil, 5-bromouracil, 5-chlorouraciI, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 

30 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5 -methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methyIthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

35 3 -(3 -amino-3 -N-2-carboxypropy 1) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
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antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisensfe orientation (z.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 
5 The antisense nucleic acid molecules of the invention are typically administered to a 

subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 

10 an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 

15 such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 
by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 

20 control of a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
-a n omeric nucleic acid molecule. An -a nomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the 
strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res 15: 6625-6641). The 

25 antisense nucleic acid molecule can also comprise a 2 -o-methylribonucleotide (Inoue et al. 

(1987) Nucleic Acids Res 15: 6131-6148) or a chimeric RNA -DNA analogue (Inoue et al. (1987) 
FEBS Lett 215: 327-330). 


4.4 RIBOZYMES AND PNA MOIETIES 

30 In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 

Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribo2ymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 

35 translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1- 
5497). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which 
the nucleotide sequence of the active site is complementary to the nucleotide sequence to be 
cleaved in an mRNA of SEQ ID NO: 1-5497 (see, e.g., Cech et al U.S. Pat. No. 4,987,071; and 

5 Cech et al. U.S. Pat. No. 5,1 16,742). Alternatively, polynucleotides of the invention can be used 
to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. 
See, e.g., Bartel et ah, (1993) Science 261:1411-1418. 

Alternatively, gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical 

10 structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) 
Anticancer Drug Des. 6: 569-84; Helene. et al. (1992) Ann. N. Y. Acad. ScL 660:27-36; and 
Maher (1 992) Bioassays 14: 807-1 5. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or 

15 solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 
acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1 996) Bioorg Med 
Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

20 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et aL (1996) above; 
Perry-O'Keefe et al. (1996) PNAS 93: 14670-675. 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 

25 example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 
gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 

30 primers for DNA sequence and hybridization (Hyrup et al. (1996), above; Perry-O'Keefe (1996), 
above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
35 delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g. , RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked 
using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 
5 the nucleobases, and orientation (Hyrup (1 996) above). The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) above and Finn et al (1996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5 -(4-methoxytrityl)amino-5-deoxy-thymidine phosphoramidite, can be used between the PNA 
10 and the 5* end of DNA (Mag et al (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5 r PNA segment and a 3* 
DNA segment (Finn et al. (1996) above). Alternatively, chimeric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment. See, Petersen et al. (1 975) Bioorg Med Chem 
Lett 5: 1119-11124. 

15 In other embodiments, the oligonucleotide may include other appended groups such as 

peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl Acad. Set UXA. 86:6553-6556; 
Lemaitre et al, 1987, Proc. Natl Acad. Sci. 84:648-652; PCT Publication No. W088/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. W089/1 01 34). In addition, 

20 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g., Krol et 
al, 1988, BioTechniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 
5:539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 


4.5 HOSTS 

The present invention further provides host cells genetically engineered to contain the 
polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 


the polynucleotides of the invention, wherein such polynucleotides are in operative association 
with a regulatory sequence heterologous to the host cell which drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
35 increase, expression of endogenous polypeptide. Cells can be modified (e.g. , by homologous 


25 


30 


methods. The present invention still further provides host cells genetically engineered to express 
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recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter with all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
is operatively linked to the encoding sequences. See, for example, PCT International Publication 
5 No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International 

Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding 

10 sequence, amplification of the marker DNA by standard selection methods results in co- 
amplification of the desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 

15 calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, 
L. et al., Basic Methods in Molecular Biology (1 986)). The host cells containing one of the 
• polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 

20 Any host/vector system can be used to express one or more of the ORFs of the present 

invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-I cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 
protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 

25 be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
aL, in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 

30 York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines capable of expressing a 
compatible vector are, for example, the CI 27, monkey COS cells, Chinese Hamster Ovary 

35 (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 
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cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived 
from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 
replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation 
5 site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or 

10 more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
Finally, high performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 

15 agents. 

Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast 
' or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial ; 

20 strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 
may be necessary to modify the protein produced therein, for example by phosphorylation or 
glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 
attachments may be accomplished using known chemical or enzymatic methods. 

25 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 
may be replaced by homologous recombination. As described herein, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 

30 gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 

regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 
negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability 
of the RNA or protein produced may be replaced, removed, added, or otherwise modified by 

35 targeting. These sequence include polyadenylation signals, mRNA stability elements, splice 
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sites, leader sequences for enhancing or modifying transport or secretion properties of the 
protein, or other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
5 gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 

enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. 
Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than 

10 the naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 
the use of one or more selectable marker genes that are contiguous with the targeting DNA, 
allowing for the selection of cells in which the exogenous DNA has integrated into the host cell 
genome. The identification of the targeting event may also be facilitated by the use. of one or * 

1 5 more marker genes exhibiting the property of negative selection, such that the negatively 

selectable marker is linked to the exogenous DNA, but configured such that the negatively ' 
selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the 

20 Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5,578,461 to Sherwin et aL; International Application No. 

25 PCT/US92/09627 (WO93/09222) by Selden et aL; and International Application No. 

PCT/US90/06436 (WO91/06667) by Skoultchi et aL, each of which is incorporated by reference 
herein in its entirety. 

4.6 POLYPEPTIDES OF THE INVENTION 

30 The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

comprising: the amino acid sequences set forth as any one of SEQ ID NO: 5498-10994 or an 
amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-5497 or the 
corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides preferably with biological or immunological activity that are encoded by: (a) a 

35 polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-5497 or 
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(b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NO: 
5498-10994 or (c) polynucleotides that hybridize to the complement of the polynucleotides of 
either (a) or (b) under stringent hybridization conditions. The invention also provides 
biologically active or immunologically active variants of any of the amino acid sequences set 
5 forth as SEQ ID NO: 5498-10994 or the corresponding full length or mature protein; and 

"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least about 
75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 
92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least about 98%, or 
most typically at least about 99% amino acid identity) that retain biological activity. 

10 Polypeptides encoded by allelic variants may have a similar, increased, or decreased activity 
compared to polypeptides comprising SEQ ID NO: 5498-10994. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein may 
be in linear form or they may be cyclized using known methods, for example, as described in H. 

15 U. Saragovi, et aL, Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et aL, J. Amer. 
Chem. Soc. 1 14, 9245-9253 (1992), both of which are incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 

The present invention also provides both full-length and mature forms (for example, 

20 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by translation of the disclosed nucleotide 
sequences. The mature form of such protein may be obtained by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 
of the protein is also determinable from the amino acid sequence of the full-length form. Where 

25 proteins of the present invention are membrane bound, soluble forms of the proteins are also 
provided. In such forms, part or all of the regions causing the proteins to be membrane bound 
are deleted so that the proteins are fully secreted from the cell in which they are expressed. 

Protein compositions of the present invention may further comprise an acceptable carrier, 
such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

30 The present invention further provides isolated polypeptides encoded by the nucleic acid 

fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 
present invention. By "degenerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 

35 acid fragments of the present invention are the ORFs that encode proteins. 
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A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
5 structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are useful, for 
example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 

10 therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 
cells which have been altered to express the desired polypeptide or protein. As used herein, a 
cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 

1 5 which the cell normally produces at a lower level. One skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic sequences into 
eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides 
or proteins of the present invention.. 

The invention also relates to methods for producing a polypeptide comprising growing a 

20 culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 
allow expression of the encoded polypeptide. The polypeptide can be recovered from the 

25 culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
further purified. Preferred embodiments include those in which the protein produced by such 
process is a full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial cells which 
naturally produce the polypeptide or protein. One skilled in the art can readily follow known 

30 methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and 
Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 

35 Manual; Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that 
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retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments that encode specific protein 
domains. 

The purified polypeptides can be used in in vitro binding assays which are well known in 
5 the art to identify molecules which bind to the polypeptides. These molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
activity in in vivo tissue culture or animal models that are well known in the art. In brief, the 
molecules are titrated into a plurality of cell cultures or animals and then tested for either 

10 cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 
may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ ID NO: 5498-10994. 

1 5 The protein of the invention may also be expressed as a product of transgenic animals, 

e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
similar to those of purified proteins but into which modification are naturally provided or 

20 deliberately engineered. For example, modifications in the peptide or DNA sequence can be 

made by those skilled in the art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino acid residue in the coding sequence. For example, one or more of the cysteine 
residues may be deleted or replaced with another amino acid to alter the conformation of the 

25 molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are - 
well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of the 
protein. Regions of the protein that are important for the protein function can be determined by 
various methods known in the art including the alanine-scanning method which involved 

30 systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting alanine-containing variant for biological activity. This type of analysis determines the 
importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein function may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be expected to 

35 retain protein activity in whole or in part and are useful for screening or other immunological 
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methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
invention to suitable control sequences in one or more insect expression vectors, and employing 
5 an insect expression system. Materials and methods for baculovirus/insect cell expression 

systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif., U.S.A. 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 

10 invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 
culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (z.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 

15 of the protein may also include an affinity column containing agents which will bind to the 
protein; one or more column steps over such affinity resins as concanavalin A-agarose, 
heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaffinity chromatography. 

20 Alternatively, the protein of the invention may also be expressed in a form that will 

facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His-tag. Kits for expression and purification of such fusion proteins are commercially available 
from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, NJ.) and Invitrogen, 

25 respectively. The protein can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 
steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 

30 aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 
homogeneous isolated recombinant protein. The protein thus purified is substantially free of 
other mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 
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The polypeptides of the invention include analogs (variants). This embraces fragments, 
as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 
modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 
5 another moiety or moieties, e.g. , targeting moiety or another therapeutic agent. Such analogs 
may exhibit improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 
provide for the delivery of polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, 
antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 
10 as receptor and ligands expressed on pancreatic or immune cells. Other moieties which may be 
fused to the polypeptide include therapeutic agents which are used for treatment, for example, 
immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and 
steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as 
alpha or beta interferon. 

15 

4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 
AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 

20 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.R 
et al, J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic Acids Res. 
vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 

25 Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- 
Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnhammer et al, Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by 
reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 
105-3 1 (1982), incorporated herein by reference). The BLAST programs are publicly available 

30 from the National Center for Biotechnology Information (NCBI) and other sources (BLAST 
Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. 
Biol. 215:403-410 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
35 protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 


WO 01/79449 PCT/US01/08656 
another polypeptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically active 
5 ' portions of a protein according to the invention. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the polypeptide according to the invention and the other 
polypeptide are fused in-frame to each other. The polypeptide can be fused to the N-terminus or 
C-terminus. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 

10 the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 
polypeptide sequences of the invention are fused to the C-terminus of the GST (Le., glutathione 
S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in which 

15 the polypeptide sequences according to the invention comprises one or more domains are fused 
to sequences derived from a member of the immunoglobulin protein family. The 
immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. 

20 The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. 
Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, e.g., cancer as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be 
used as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays 

25 to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 
are ligated together in-frame in accordance with conventional techniques, e.g., by employing 
blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 

30 appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 
be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments that can 

35 subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
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example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John Wiley & 
Sons, 1992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the 
invention can be cloned into such an expression vector such that the fusion moiety is linked 
5 in-frame to the protein of the invention. 

4.8 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of normal 
function of the encoded protein. The invention thus provides gene therapy to restore normal 

10 activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to 
appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, 

15 Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of 
gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific 
American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of 
the nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient expression) or 

20 artificial chromosomes (stable expression), Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 
Alternatively, it is contemplated that in other human disease states, preventing the expression of 
or inhibiting the activity of polypeptides of the invention will be useful in treating the disease 

25 states. It is contemplated that antisense therapy or gene therapy could be applied to negatively 
regulate the expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 
molecules to the nucleic acids of the present invention, their complements, or their translated RNA 
sequences, by methods known in the art. Further, the polypeptides of the present invention can be 

30 inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to express the 
polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in 
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the cell. These methods can be used to increase or decrease the expression of the polynucleotides of 
the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of cells to 
permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g. , 
5 by homologous recombination) to provide increased polypeptide expression by replacing, in whole 
or in part, the naturally occurring promoter with all or part of a heterologous promoter so that the 
cells express the protein at higher levels. The heterologous promoter is inserted in such a manner 
that it is operatively linked to the desired protein encoding sequences. See, for example, PCT 
International Publication No. WO 94/12650, PCT International Publication No. WO 92/20808, and 

10 PCT International PublicationNo. WO 91/09955. It is also contemplated that, in addition to 
heterologous promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional 
CAD gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and 
dihydroorotase) and/or intron DNA may be inserted along with the heterologous promoter DNA. If 
linked to the desired protein coding sequence, amplification of the marker DNA by standard 

1 5 selection methods results in co-amplification of the desired protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may 
« be replaced by homologous recombination. As described herein, gene targeting can be used to 

20 replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory 
sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 
of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or 

25 protein produced may be replaced, removed, added, or otherwise modified by targeting. These 

sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences 
which alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 

30 under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 
element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
targeting event may replace an existing element; for example, a tissue-specific enhancer can be 
replaced by an enhancer that has broader or different cell-type specificity than the naturally 

3 5 occurring elements. Here, the naturally occurring sequences are deleted and new sequences are 
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added. In all cases, the identification of the targeting event may be facilitated by the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the cell genome. The identification of the 
targeting event may also be facilitated by the use of one or more marker genes exhibiting the 
5 property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and 
such that a con-ect homologous recombination event with sequences in the host cell genome does 
not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 

1 0 xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with this 
aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to Chappel; 
U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. PCT/US92/09627 
(WO93/09222)by Selden et al.; and International Application No. PCT/US90/06436 

15 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference herein in its entirety. 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 

20 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 

25 prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 

Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
systems to identify compounds that modulate lipid metabolism. Transgenic animals, preferably 
non-human mammals, are produced using methods as described in U.S. Patent No 5,489,743 and 

30 PCT Publication No. W094/28 1 22, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or even 

35 replacing the homologous promoter to provide for increased protein expression. The 
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homologous promoter can be supplemented by insertion of one or more heterologous enhancer 
elements known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
through, e.g., homologous recombination or knock out strategies, of animals that fail to express 
5 polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vzvo, one or more genes provided by the invention are either over expressed or 

10 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)). Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 

15 prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 

Transgenic animals are useful to determine the roles polypeptides of the invention play in> 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
systems to identify compounds that modulate lipid metabolism. Transgenic animals, preferably 
non-human mammals, are produced using methods as described in U.S. Patent No 5,489,743 and 

20 PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous recombination 
methods described above. Activation can be achieved by supplementing or even replacing the 

25 homologous promoter to provide for increased protein expression. The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer promoter activation in a particular tissue. 


4.10 USES AND BIOLOGICAL ACTIVITY 

30 The polynucleotides and proteins of the present invention are expected to exhibit one or 

more of the uses or biological activities (including those associated with assays cited herein) 
identified herein. Uses or activities described for proteins of the present invention may be 
provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene therapies or vectors suitable for introduction of DNA). The 

35 mechanism underlying the particular condition or pathology will dictate whether the 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment. Thus, "therapeutic 
compositions of the invention" include compositions comprising isolated polynucleotides 
(including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
5 polypeptides of the invention (including full length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modulate the overall activity of the 
target gene products, either at the level of target gene/protein expression or target protein 
activity. Such modulators include polypeptides, analogs, (variants), including fragments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 

1 0 indirectly activate or inhibit the polypeptides of the invention (identified, e.g. , via drug screening 
assays as described herein); antisense polynucleotides and polynucleotides suitable for triple 
helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular activation 

15 or in one of the other physiological pathways described herein. 

4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analysis, characterization or therapeutic use; as markers for tissues in which the 

corresponding protein is preferentially expressed (either constitutively or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 
positions; to compare with endogenous DNA sequences in patients to identify potential genetic 

25 disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 
oligomers for attachment to a "gene chip" or other support, including for examination of 
expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 

30 an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 

polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 
example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 

35 the binding interaction. 
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The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 

5 receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is 
preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

10 Any or all of these research utilities are capable of being developed into reagent grade or 

kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch 

1 5 and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1 987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
20 sources or supplements. Such uses include without limitation use as a protein or amino acid 

supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or liquid preparation, such as in the 
form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
25 polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured 

4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

30 A polypeptide of the present invention may exhibit activity relating to cytokine, cell 

proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
activity or may induce production of other cytokines in certain cell populations. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 

35 or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
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confirmation of cytokine activity. The activity of therapeutic compositions of the present 
invention is evidenced by any one of a number of routine factor dependent cell proliferation 
assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/1 1, BaF3, 
MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, 
5 HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 
Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 

10 Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 

145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, 
et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 
thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 

1 5 Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; 
deVriesetal.,J.Exp. Med. 173:1205-1211, 1991; Moreauetal., Nature 336:690-692, 1988; 
Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 

25 and human interleukin 6~Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S. A. 83:1857-1861, 1986; Measurement of human Interleukin 1 1 -Bennett, F., Giannotti, J., 
Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 

30 9-Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among others, proteins 
that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 

35 Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, 
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Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 
Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 77:6091-6095, 
1980; Weinberger etal., Eur. J. Immun. 11:405-411, 1981;Takai etal., J. Immunol. 
5 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor activity and 
be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem 

10 cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 
germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 
ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 

1 5 large quantities of human cells has important working applications for the production of human 
proteins which currently must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 

20 for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth factors listed herein, other stem cell maintenance factors, and 
specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-S ligand (Fit- 

25 3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 

inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic 
fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 

30 these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 
optionally with other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 

35 with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
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layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention to induce 
5 autocrine expression of the polypeptide of the invention. This will allow for generation of 

undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be 
differentiated into the desired mature cell types. These stable cell lines can also serve as a source 
of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 

1 0 identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 

1 5 used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 
genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, /. e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In 

20 addition, the expanded stem cell populations can also be genetically altered for gene therapy 
purposes and to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 
types. A broadly applicable method of obtaining pure populations of a specific differentiated 

25 cell type from undifferentiated stem cell populations involves the use of a cell-type specific 

promoter driving a selectable marker. The selectable marker allows only cells of the desired type 
to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus 
et ah, Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 

30 Academic Press (1 997)). Alternatively, directed differentiation of stem cells can be 

accomplished by culturing the stem cells in the presence of a differentiation factor such as 
retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the invention 

35 exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in 
the presence of the polypeptide of the invention alone or in combination with other growth 
factors or cytokines. The ability of the polypeptide of the invention to induce stem cells 
proliferation is determined by colony formation on semi-solid support e.g. as described by 
Bernstein et al., Blood, 77: 2316-2321 (1991). 


4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 
biological activity in support of colony forming cells or of factor-dependent cell lines indicates 
involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythroid progenitor cells alone or in combination with other cytokines, thereby indicating 
utility, for example, in treating various anemias or for use in conjunction with 
irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid 
cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and 
monocytes/macrophages (i.e., traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 
treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as 
those usually treated with transplantation, including, without limitation, aplastic anemia and 
paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 
as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular 
and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903-2915, 1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 
Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., 
Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. L Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 
al., Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 
Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture 
initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue 
repair and replacement, and in healing of burns, incisions and ulcers. 

A polypeptide of the present invention which induces cartilage and/or bone growth in 
circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 
prophylactic use in closed as well as open fracture reduction and also in the improved fixation of 
artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming cells, 
stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 
inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible using the composition of the 
invention. 
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Another category of tissue regeneration activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or 
other tissue formation in circumstances where such tissue is not normally formed, has 
application in the healing of tendon or ligament tears, deformities and other tendon or ligament 
5 defects in humans and other animals. Such a preparation employing a tendon/ligament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma 

10 induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The compositions of the present 
invention may provide environment to attract tendon- or ligament-forming cells, stimulate 
growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or 
ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 

15 return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 
treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The 
compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is 
well known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 

20 cells and for regeneration of nerve and brain tissue, le. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 
peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 

25 system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions that may be treated in 
accordance with the present invention include mechanical and traumatic disorders, such as spinal 
cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 

30 composition of the invention. 

Compositions of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 

35 regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, 
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kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue 
to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 
5 A composition of the present invention may also be useful for gut protection or 

regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
1 0 growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described in: 
International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
15 WO91/07491 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described in: Winter, 
Epidermal Wound Healing, pps. 71-1 12 (Maibach, H. I: and Rovee, D. T., eds.), Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84(1978). 

20 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 
A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 
herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 

25 protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 
and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More 

30 specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be 

treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, 
herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
where a boost to the immune system generally may be desirable, Le. , in the treatment of cancer. 
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Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 
autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host 
disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 
including antibodies) of the present invention may also to be useful in the treatment of allergic 
reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 
Stevens- Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
(particularly allergic asthma) or other respiratory problems. Other conditions, in which immune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 
models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 
1998), skin prick test (Hoffmann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 
J. Toxicol. Environ. Health 53: 563-79). 

Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking 
an immune response already in progress or may involve preventing the induction of an immune 
response. The functions of activated T cells may be inhibited by suppressing T cell responses or 
by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is 
generally an active, non-antigen-specific, process which requires continuous exposure of the T 
cells to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy 
in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 
persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 
of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell 
function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 
followed by an immune reaction that destroys the transplant. The administration of a therapeutic 
composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 
and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be 
5 sufficient to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
tolerance in a subject, it may also be necessary to block the function of a combination of B 
lymphocyte antigens. 

10 The efficacy of particular therapeutic compositions in preventing organ transplant 

rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 
rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et . 

15 al., Science 257:789-792 (1 992) and Turka et al., Proc. Natl. Acad. Sci USA, 89: 1 1 1 02-1 1 1 05 
(1992). In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven * 
Press, New York, 1989, pp. 846-847) can be used to determine the effect of therapeutic 
compositions of the invention on the development of that disease. 


20 diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that 
are reactive against self-tissue and which promote the production of cytokines and 
autoantibodies involved in the pathology of the diseases. Preventing the activation of 
autoreactive T cells may reduce or eliminate disease symptoms. Administration of reagents 
which block stimulation of T cells can be used to inhibit T cell activation and prevent production 

25 of autoantibodies or T cell-derived cytokines which may be involved in the disease process. 
Additionally, blocking reagents may induce antigen-specific tolerance of autoreactive T cells 
which could lead to long-term relief from the disease. The efficacy of blocking reagents in 
preventing or alleviating autoimmune disorders can be determined using a number of 
well-characterized animal models of human autoimmune diseases. Examples include murine 

30 experimental autoimmune encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or 

NZB hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB 
rats, and murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven 
Press, New York, 1989, pp. 840-856). 


35 of up regulating immune responses, may also be useful in therapy. Upregulation of immune 


Blocking antigen function may also be therapeutically useful for treating autoimmune 


Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a means 
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responses may be in the form of enhancing an existing immune response or eliciting an initial 
immune response. For example, enhancing an immune response may be useful in cases of viral 
infection, including systemic viral diseases such as influenza, the common cold, and 
encephalitis. 


removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
patient. Another method of enhancing anti-viral immune responses would be to isolate infected 
1 0 cells from a patient, transfect them with a nucleic acid encoding a protein of the present 

invention as described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would now be 
capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 


1 5 cells to induce a T cell mediated immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 
reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an 
MHC class I alpha chain protein and p 2 microglobulin protein or an MHC class II alpha chain 

20 protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II 
proteins on the cell surface. Expression of the appropriate class I or class II MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class II associated protein, such as 

25 the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 
of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject 


30 following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 

35 Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 


5 


Alternatively, anti-viral immune responses may be enhanced in an infected patient by 


A polypeptide of the present invention may provide the necessary stimulation signal to T , 


The activity of a protein of the invention may, among other means, be measured by the 
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78:2488-2492, 1981; Hermann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., J. 
Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et al., 
Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 
5 Assays for T-cell-dependent immunoglobulin responses and isotype switching (which 

will identify, among others, proteins that modulate T-cell dependent antibody responses and that 
affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 
Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 

10 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 
that generate predominantly Thl and CTL responses) include, without limitation, those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Witey-Interscience (Chapter 3, 

1 5 In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. . 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells that activate naive T-cells) include, without limitation, those described in: 

20 Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine 
173:549-559, 1991; Macatonia et al., Journal of Immunology 154:5071-5079, 1995; Porgador et 
al., Journal of Experimental Medicine 182:255-260, 1995; Nair et al., Journal of Virology 
67:4062-4069, 1993; Huang et al., Science 264:961-965, 1994; Macatonia et al., Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 

25 94:797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 
that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 
13:795-808, 1 992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 

30 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International 
Journal of Oncology 1 :639-648, 1 992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, without limitation, those described in: Antica et al., Blood 84:1 1 1-1 17, 1994; Fine et 
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aL, Cellular Immunology 155:1 1 1-122, 1994; Galy et al., Blood 85:2770-2778, 1995; Toki et al., 
Proc. Nat. Acad Sci. USA 88:7548-7551, 199L 


4.10.8 ACTIVIN/INHIBIN ACTIVITY 

5 A polypeptide of the present invention may also exhibit activin- or inhibin-related 

activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 

10 alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive 
based on the ability of inhibins to decrease fertility in female mammals and decrease 
spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as 

15 a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 
- polypeptide of the invention may also be useful for advancement of the onset of fertility in 
sexually immature mammals, so as to increase the lifetime reproductive performance of domestic 
animals such as, but not limited to, cows, sheep and pigs. 

20 The activity of a polypeptide of the invention may, among other means, be measured by 

the following methods. 

Assays for activin/inhibin activity include, without limitation, those described in: Vale et 
al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et al., Nature 
321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. 

25 USA 83:3091-3095, 1986. 

4.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, 

30 T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the 
invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 
receptor activation can be used to mobilize or attract a desired cell population to a desired site of 
action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, 
or modulators of the invention) provide particular advantages in treatment of wounds and other 

35 trauma to tissues, as well as in treatment of localized infections. For example, attraction of 
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lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
Whether a particular protein has chemotactic activity for a population of cells can be readily 
determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMS 103:140-146, 
1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 
1 994; Johnston et al. J. of Immunol. 1 53 : 1 762- 1 768, 1 994. 

4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Compositions may be useful in treatment of various coagulation disorders (including 
hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 
in treating wounds resulting from trauma, surgery or other causes. A composition of the 
invention may also be useful for dissolving or inhibiting formation of thromboses and for 
treatment and prevention of conditions resulting therefrom (such as, for example, infarction of 
cardiac and central nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 
45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467-474, 1988. 


4.10.11 CANCER DIAGNOSIS AND THERAPY 
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Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
5 may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 
condition. Identification of single nucleotide polymorphisms associated with cancer or a 
predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 

10 inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor ceil motility or invasiveness. Therapeutic 
compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 
cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 

15 acute and chronic Ieukemias, and lymphomas, head and neck cancers including mouth cancer, 
• larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 
cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 
cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including 

20 bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, 
kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 

25 tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 
hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 
inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 

30 effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 

chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as a 

35 portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
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modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutical^ 
acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
Anti-cancer drugs that are well known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of the invention include: Actinomycin D, 
5 Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, 

Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, Cytarabine HC1 (Cytosine arabinoside), 
Dacarbazine, Dactinomycin, Daunorubicin HC1, Doxorubicin HC1, Estramustine phosphate 
sodium, Etoposide (VI 6-21 3), Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea 
(hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate 

10 (LHRH-releasing factor analog), Lomustine, Mechlorethamine HC1 (nitrogen mustard), 
Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, 
Octreotide, Plicamycin, Procarbazine HC1, Streptozocin, Tamoxifen citrate, Thioguanine, 
Thiotepa, Vinblastine sulfate, Vincristine sulfate, Amsacrine, Azacitidine, Hexamethylmelamine, 
Interleukin-2, Mitoguazone, Pentostatin, Semustine, Teniposide, and Vindesine sulfate. 

15 In addition, therapeutic compositions of the invention may be used for prophylactic 

1 treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. • 
exposure to carcinogens) known in the art that predispose an individual to developing cancers. 
Under these circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

20 In vitro models can be used to determine the effective doses of the polypeptide of the 

invention as a potential cancer treatment. These in vitro models include proliferation assays of 
cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Lxss, New York, NY Ch 18 and Ch 21), 
tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 52: 921-30 

25 (1 974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 
Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 
of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al., 
Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 

30 e.g. from American Type Tissue Culture Collection catalogs. 


4.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
35 invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 


WO 01/79449 



PCT/US01/08656 


and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and 
their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions 
and their ligands (including without limitation, cellular adhesion molecules (such as selectins, 
integrins and their ligands) and receptor/Iigand pairs involved in antigen presentation, antigen 
5 recognition and development of cellular and humoral immune responses. Receptors and ligands 
are also useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/Iigand interaction. A protein of the present invention (including, without limitation, 
fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/Iigand 
interactions. 

1 0 The activity of a polypeptide of the invention may, among other means, be measured by 

the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described in: 

Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. 

Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 
1 5 Measurement of Cellular Adhesion under static conditions 7.28. 1- 7.28.22), Takai et al., Proc. 

Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 1 68:1 145-1 156, 1988; 

Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 

175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 
20 ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 

through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 

overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 

partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 
25 present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 

colorimetric molecules or toxin molecules by conventional methods. ("Guide to Protein 

Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic 

Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and 

carbon-14. Examples of colorimetric molecules include, but are not limited to, fluorescent 
30 molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of 

toxins include, but are not limited, to ricin. 


35 


4.10.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
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The polypeptides or fragments employed in such a test may either be free in solution, affixed to a 
solid support, borne on a cell surface or located intracellularly. One method of drug screening 
utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the polypeptide or a fragment thereof. Drugs are screened against such 
5 transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
be used for standard binding assays. One may measure, for example, the formation of 
complexes between polypeptides of the invention or fragments and the agent being tested or 
examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, which are well known in the art. 

10 Sources for test compounds that may be screened for ability to bind to or modulate (i.e., 

increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 
organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 
Chemical libraries may be readily synthesized or purchased from a number of 

15 commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and 
fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 
screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 

20 microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 
review, see Science 252:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 

25 PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and 

oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein 
peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. 
For a review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. 
Biotechnol 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 

30 Al-Obeidi et al., Mol Biotechnol 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 

1(1):114-19 (1997); Dorner et al., Bioorg Med Chem, 4(S):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein permits 
modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a 
polypeptide of the invention. The molecules identified in the binding assay are then tested for 

35 antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
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art. In brief, the molecules are titrated into a plurality of cell cultures or animals and then tested 
for either cell/animal death or prolonged survival of the animal/cells. 

The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide e.g. a 
ligand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknown binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 
to identify polynucleotides encoding binding partners. As another example, affinity 
chromatography with the appropriate immobilized polypeptide of the invention can be used to 
isolate polypeptides that recognize and bind polypeptides of the invention. There are a number 
of different libraries used for the identification of compounds, and in particular small molecules, 
that modulate (i.e., increase or decrease) biological activity of a polypeptide of the invention. 
Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical except for 
the expression of the receptor of the invention: one cell population expresses the receptor of the 
invention whereas the other does not. The responses of the two cell populations to the addition 
of ligands(s) are then compared. Alternatively, an expression library can be co-expressed with 
the polypeptide of the invention in cells and assayed for an autocrine response to identify 
potential ligand(s). As still another example, BIAcore assays, gel overlay assays, or other 
methods known in the art can be used to identify binding partner polypeptides, including, (1) 
organic and inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 
polypeptide of the invention can be determined. For example, a chimeric protein in which the 
cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 
protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated 
with the ligand specific for the extracellular portion of the chimeric protein, thereby activating 
the chimeric receptor. Known downstream proteins involved in intracellular signaling can then 
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be assayed for expected modifications z.e. phosphorylation. Other methods known to those in 
the art can also be used to identify signaling molecules involved in receptor activity. 

4.10.15 ANTI-INFLAMMATORY ACTIVITY 

5 Compositions of the present invention may also exhibit anti-inflammatory activity. The 

anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the 
inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 

10 of other factors which more directly inhibit or promote an inflammatory response. Compositions 
with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 
shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 

15 chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production of cytokines such as TNF or IL-1 . Compositions of the invention may also be 
useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 
Compositions of this invention may be utilized to prevent or treat conditions such as, but not 
limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 

20 arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1 , 
graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 
acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

25 

4.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
30 acute lymphocytic leukemia, acute myelocytic leukemia, myeloblasts, promyelocytic, 
myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 
(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 


35 


4.10.17 NERVOUS SYSTEM DISORDERS 


WO 01/79449 



PCTYUS01/08656 


Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
human and non-human mammalian patients) according to the invention include but are not 
limited to the following lesions of either the central (including spinal cord, brain) or peripheral 
nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
surgery, for example, lesions which sever a portion of the nervous system, or compression 
injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 
infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by human 
immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 
tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 
with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 
sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, 
tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus 
callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or 
sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
injured by a demyelinating disease including but not limited to multiple sclerosis, human 
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immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 

Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
5 differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
any of the following effects may be useful according to the invention: 


10 choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 
(iv) decreased symptoms of neuron dysfunction in vivo. 
Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method set 
forth in Arakawa et al. (1 990, J. Neurosci. 1 0:3507-35 1 5); increased sprouting of neurons may 

15 be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. 
(1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 
be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., 
depending on the molecule to be measured; arid motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron 

20 conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may.be treated according to the 
invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 

25 neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 
muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 
muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 
(Charcot-Marie-Tooth Disease). 


A polypeptide of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, 
including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
35 or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 


(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., 


30 


4.10.18 


OTHER ACTIVITIES 
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color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape 
(such as, for example, breast augmentation or diminution, change in bone form or shape); 
effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 
5 elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 
nutritional factors or component(s); effecting behavioral characteristics, including, without 
limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 

10 than hematopoietic lineages; hormonal or endocrine activity; in the case of en2ymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such 
as, for example, the ability to bind' antigens or complement); and the ability to act as an antigen 
in a vaccine composition to raise an immune response against such protein or another material or 

1 5 entity which is cross-reactive with such protein. 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 

20 and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 
response) or a differential response to drug administration, and this genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 

25 possible the diagnosis of this condition in humans by identifying the presence of the 
polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 

30 polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment 
of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected to 
allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are 
hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a 
single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately 

35 adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). 
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In addition, traditional restriction fragment length polymorphism analysis (using restriction 
enzymes that provide differential digestion of the genomic DNA depending on the presence or 
absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 
5 nucleotide sequences of the present invention in order to detect the nucleotide sequences of the 
present invention. In the alternative, any one of the nucleotide sequences of the present 
invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., 
10 by an antibody specific to the variant sequence. 


4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 
arthritis are determined in an experimental animal model system. The experimental model 

1 5 system is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at., 
1983, Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol., 
23:129. Induction of the disease can be caused by a single injection, generally intradermally, of a 
suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The 
route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 

20 mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of 
about 1-5 mg/kg. The control consists of administering PBS only. 

The procedure for testing the effects of the test compound would consist of intradermally 
injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 

25 24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 
described by J. Holoskitz above. An analysis of the data would reveal that the test compound 
would have a dramatic affect on the swelling of the joints as measured by a decrease of the 
arthritis score. 


30 4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
other binding partners or modulators including antisense polynucleotides) of the invention have 
numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 
include, but are not limited to, those exemplified herein. 


35 
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4.11J EXAMPLE 

One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the' invention. While the mode of 
administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, weight, 
condition and response of the individual patient. Typically, the amount of polypeptide 
administered per dose will be in the range of about O.Ol^ig/kg to 100 mg/kg of body weight, with 
the preferred dose being about 0. 1 |!g/kg to 1 0 mg/kg of patient body weight. For parenteral 
administration, polypeptides of the invention will be formulated in an injectable form combined 
with a pharmaceutically acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of the human serum albumin. The vehicle may contain minor amounts of 
additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. 
The preparation of such solutions is within the skill of the art. 


4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

A protein or other composition of the present invention (from whatever source derived, 
including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 
carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 
may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 
M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, 
IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These agents 
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include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
factor (PDGF), transforming growth* factors (TGF-a and TGF-P), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either enhance 
5 the activity of the protein or other active ingredient or complement its activity or use in 
treatment. Such additional factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 
present invention may be included in formulations of the particular clotting factor, cytokine, 

1 0 lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- 
inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as 
IL-IRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein 
of the present invention may be active in multimers (e.g., heterodimers or homodimers) or 

1 5 complexes with itself or other proteins. As a result, pharmaceutical compositions of the 
invention may comprise a protein of the invention in such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g., at the same time, or at differing times provided that 

20 therapeutic concentrations of the combination of agents is achieved at the treatment site). 

Techniques for formulation and administration of the compounds of the instant application may 
be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose further refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the 

25 relevant medical condition, or an increase in rate of treatment, healing, prevention or 

amelioration of such conditions. When applied to an individual active ingredient, administered 
alone, a therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 
ingredients that result in the therapeutic effect, whether administered in combination, serially or 

30 simultaneously. 

In practicing the method of treatment or use of the present invention, a therapeutically 
effective amount of protein or other active ingredient of the present invention is administered to 
a mammal having a condition to be treated. Protein or other active ingredient of the present 
invention may be administered in accordance with the method of the invention either alone or in 

35 combination with other therapies such as treatments employing cytokines, lymphokines or other 
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hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other 
hematopoietic factors, protein or other active ingredient of the present invention may be 
administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 
factor(s), thrombolytic or antithrombotic factors, or sequentially. If administered sequentially, 
5 the attending physician will decide on the appropriate sequence of administering protein or other 
active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or antithrombotic factors. 

4.12.1 ROUTES OF ADMINISTRATION 

10 Suitable routes of administration may, for example, include oral, rectal, transmucosal, or 

intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 
intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 

15 method of the present invention can be carried out in a variety of conventional ways, such as oral 
ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral 
or intravenous inj ection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
. example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often 

20 in a depot or sustained release formulation. In order to prevent the scarring process frequently 
occurring as complication of glaucoma surgery, the compounds may be administered topically, 
for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 

25 afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an effective 
dosage to the desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
30 ranges for the polypeptides of the invention can be extrapolated from these dosages or from 
similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit. 


4.12.2 COMPOSITIONS/FORMULATIONS 
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Pharmaceutical compositions for use in accordance with the present invention thus may 
be formulated in a conventional manner using one or more physiologically acceptable carriers 
comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutical^. These pharmaceutical compositions may be 
5 manufactured in a manner that is itself known, e.g., by means of conventional mixing, 

dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes. Proper formulation is dependent upon the route of administration 
chosen. When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 

10 invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 
carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 
90% protein or other active ingredient of the present invention. When administered in liquid 

1 5 form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 
pharmaceutical composition may further contain physiological saline solution, dextrose or other 
saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 

20 90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally 

25 acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 
active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 
the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
. present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 

30 Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 

other vehicle as known in the art. The pharmaceutical composition of the present invention may 
also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 
skill in the art. For injection, the agents of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 

35 solution, or physiological saline buffer. For tiransmucosal administration, penetrants appropriate 
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to the barrier to be permeated are used in the formulation. Such penetrants are generally known 
in the art. 

For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers 
5 enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained from a solid excipient, 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 

10 particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 

preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 
gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 

15 thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be 
added to the tablets or dragee coatings for identification or to characterize different combinations 

20 of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 
sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 

25 optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

30 For administration by inhalation, the compounds for use according to the present 

invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 
other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 

35 providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use 
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in an inhaler or insufflator may be formulated containing a powder mix of the compound and a 
suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 
administration by injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with 
an added preservative. The compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compounds 
1 0 may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 

vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 
increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents which 
1 5 increase the solubility of the compounds to allow for the preparation of highly concentrated 
solutions. Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g. , sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
20 glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
25 sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 
system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and 
an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 
30 polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 
(VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water solution. This co-solvent 
system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Naturally, the proportions of a co-solvent system may be varied considerably 
without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 
35 co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 
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be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other 
biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 
sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 
5 known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 
such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent 
Various types of sustained-release materials have been established and are well known by those 

10 skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 

15 or excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 
provided as salts with pharmaceutical^ compatible counter ions. Such pharmaceutical^ 
acceptable base addition salts are those salts which retain the biological effectiveness and 

20 properties of the free acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 
monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
25 protein(s) or other active ingredient(s) of present invention along with protein or peptide 

antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B-lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 
30 those encoded by class I and class II MHC genes on host cells will serve to present the peptide 
antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 
MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 
well as antibodies able to bind the TCR and other molecules on T cells can be combined with the 
35 pharmaceutical composition of the invention. 
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The pharmaceutical composition of the invention may be in the form of a liposome in 
which protein of the present invention is combined, in addition to other pharmaceutically 
acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 
5 lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 
sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 
liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 
Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated 
herein by reference. 

1 0 The amount of protein or other active ingredient of the present invention in the 

pharmaceutical composition of the present invention will depend upon the nature and severity of 
the condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amount of protein or other active 
ingredient of the present invention with which to treat each individual patient. Initially, the 

15 attending physician will administer low doses of protein or other active ingredient of the present 
invention and observe the patient's response. Larger doses of protein or other active ingredient 
of the present invention may be administered until the optimal therapeutic effect is obtained for • 
the patient, and at that point the dosage is not increased further. It is contemplated that the 
various pharmaceutical compositions used to practice the method of the present invention should 

20 contain about 0.01 ng to about 100 mg (preferably about 0.1 |ig to about 10 mg, more preferably 
about 0.1 jag to about 1 mg) of protein or other active ingredient of the present invention per kg 
. body weight. For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the therapeutic 

25 composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable 
form. Further, the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention which may also optionally be included in the composition as 

30 described above, may alternatively or additionally, be administered simultaneously or 

sequentially with the composition in the methods of the invention. Preferably for bone and/or 
cartilage formation, the composition would include a matrix capable of delivering the 
protein-containing or other active ingredient-containing composition to the site of bone and/or 
cartilage damage, providing a structure for the developing bone and cartilage and optimally 
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capable of being resorbed into the body. Such matrices may be formed of materials presently in 
use for other implanted medical applications. 

The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and interface properties. The particular application of the 
5 compositions will define the appropriate formulation. Potential matrices for the compositions 
may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 
hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 

10 matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above- 
mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalcium phosphate. The bioceramics may be altered in composition, such as in 
calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 

15 biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 

glycolic acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 
cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix.. 

20 A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 

(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 
hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 
carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 

25 polyethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and polyvinyl alcohol). 
The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on 
total formulation weight, which represents the amount necessary to prevent desorption of the 
protein from the polymer matrix and to provide appropriate handling of the composition, yet not 
so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 

30 protein the opportunity to assist the osteogenic activity of the progenitor cells. In further 

compositions, proteins or other active ingredients of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor (EGF), 
platelet derived growth factor (PDGF), transforming growth factors (TGF-oc and TGF-P), and 

35 insulin-like growth factor (IGF). 
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The therapeutic compositions are also presently valuable for veterinary applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
patients for such treatment with proteins or other active ingredients of the present invention. The 
dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by the attending physician considering various factors which 
modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 
damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
and with inclusion of other proteins in the pharmaceutical composition. For example, the 
addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final 
composition, may also effect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example, X-rays, histomorphometric determinations and 
tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other known 
methods for introduction of nucleic acid into a cell or organism (including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

4,123 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve its 
intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. For any compound used in 
the method of the invention, the therapeutically effective dose can be estimated initially from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 
circulating concentration range that can be used to more accurately deteimine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 
concentration range that includes the IC50 as determined in cell culture (i.e. 9 the concentration of 
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the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
5 efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose 
ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LD50 and ED50. Compounds which exhibit high therapeutic indices are preferred. 

10 The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED50 with little or no toxicity. The dosage may 
vary within this range depending upon the dosage form employed and the route of administration 
utilized. The exact formulation, route of administration and dosage can be chosen by the 

15 individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The 

Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount and interval may be adjusted 
individually to provide plasma levels of the active moiety which are sufficient to maintain the 
desired effects, or minimal effective concentration (MEC). The MEC will vary for each 
compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 

20 depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should be 
administered using a regimen that maintains plasma levels above the MEC for 10-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 

25 administration or selective uptake, the effective local concentration of the drug may not be 
related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 
will be in the range of about 0.01 fig/kg to 100 mg/kg of body weight daily, with the preferred 
dose being about 0.1 jig/kg to 25 mg/kg of patient body weight daily, varying in adults and 

30 children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

35 
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4.12.4 PACKAGING 

The compositions may, if desired, be presented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
be accompanied by instructions for administration. Compositions comprising a compound of the 
invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 

4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the 
invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, le. 9 molecules that contain 
an antigen-binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F a b, F ab ' and F (a b')2 
fragments, and an F a b expression library. In general, an antibody molecule obtained from 
humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another 
by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, 
such as IgGi, IgG2, and others. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

An isolated related protein of the invention may be intended to serve as an antigen, or a 
portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 
invention provides antigenic peptide fragments of the antigen for use as immunogens. An 
antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence 
of any of the full length proteins of the invention, and encompasses an epitope thereof such that 
an antibody raised against the peptide forms a specific immune complex with the full length 
protein or with any fragment that contains the epitope. Preferably, the antigenic peptide 
comprises at least 10 amino acid residues, or at least 15 amino acid residues, or at least 20 amino 
acid residues, or at least 30 amino acid residues. Preferred epitopes encompassed by the 
antigenic peptide are regions of the protein that are located on its surface; commonly these are 
hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region on the surface of the protein of the invention^ e.g., a hydrophilic 
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region. A hydrophobicity analysis of the human related protein sequence will indicate which 
regions of a related protein are particularly hydrophilic and, therefore, are likely to encode 
surface residues useful for targeting antibody production. As a means for targeting antibody 
production, hydropathy plots showing regions of hydrophilicity and hydrophobicity may be 
5 generated by any method well known in the art, including, for example, the Kyte Doolittle or the 
Hopp Woods methods, either with or without Fourier transformation. See, e.g., Hopp and 
Woods, 1981, Proc. Nat Acad. Set USA 78: 3824-3828; Kyte and Doolittle 1982, J. Mol Biol 
157: 105-142, each of which is incorporated herein by reference in its entirety. Antibodies that 
are specific for one or more domains within an antigenic protein, or derivatives, fragments, 
1 0 analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal or 
1 5 monoclonal antibodies directed against a protein of the invention, or against derivatives, 

fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
Manual, Harlow E, and Lane D, 1 988, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 

20 5.13.1 Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 

25 protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 

recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated to 
a second protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further include an 

30 adjuvant. Various adjuvants used to increase the immunological response include, but are not 
limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface 
active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 
Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 
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adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 
synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal (e.g., from the blood) and further purified by well known techniques, 
such as affinity chromatography using protein A or protein G, which provide primarily the IgG 
fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the 
target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to 
purify the immune specific antibody by immunoaffinity chromatography. Purification of 
immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 
Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 
herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique light chain gene product and a unique heavy chain 
gene product. In particular, the complementarity determining regions (CDRs) of the monoclonal 
antibody are identical in all the molecules of the population. MAbs thus contain an antigen- 
binding site capable of immunoreacting with a particular epitope of the antigen characterized by 
a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature^ 256:495 (1975). In a hybridoma method, a mouse, 
hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or a 
fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Coding, Monoclonal 
Antibodies: Principles and Practice, Academic Press, (1986) pp. 59-103). Immortalized cell 
lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
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culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
5 such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 
can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 
California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, J. Immunol.. 133:3001 (1984); Brodeur et al., Monoclonal 
10 Antibody Production Te chniques and Applications. Marcel Dekker, Inc., New York, (1987) pp. 
51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 

1 5 immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 

enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard. Anal. Biochem.. 107:220 (1980'). Preferably, 
antibodies having a high degree of specificity and a high binding affinity for the target antigen 

20 are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 
dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

25 The monoclonal antibodies secreted by the subclones can be isolated or purified from the 

culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 

30 those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 

35 then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 


10 
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myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 
example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,8 1 6,567; Morrison, Nature 368. 
812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 


5.13.2 Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further comprise 
humanized antibodies or human antibodies. These antibodies are suitable for administration to 
humans without engendering an immune response by the human against the administered 

15 immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 

immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et aL, 

20 Nature, 321:522-525 (1986); Riechmann et aL, Nature. 332:323-327 (1988); Verhoeyen et aL, 
Science . 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 

25 in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially ail of at least one, and typically two, variable 
domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 

30 least a portion of an immunoglobulin constant region (Fc), typically that of a human 

immunoglobulin (Jones et aL, 1986; Riechmann et aL, 1988; and Presta, Curr. Op. Struct. Biol.. 
2:593-596(1992)). 


5*13.3 Human Antibodies 
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Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 
antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol- 227:381 (1991); 
Marks et al., J. Mol. Biol., 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans 
in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 
is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779-783 (1992)); Lonberg et al. 
(Nature 368 856-859 (1994)); Morrison ( Nature 368, 812-13 (1994)); Fishwild et al,( Nature 
Biotechnology 14. 845-51 (1996)); Neuberger (Nature Biotechnology 14. 826 (1996)); and 
Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The 
endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 
provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse™ 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 
cells which secrete fully human immunoglobulins. The antibodies can be obtained directly from 
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the animal after immunization with an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
immunoglobulins with human variable regions can be recovered and expressed to obtain the 
5 antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 
example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 
expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 
5,939,598. It can be obtained by a method including deleting the J segment genes from at least 

10 one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 

locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain locus, 
the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 
and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells 
contain the gene encoding the selectable marker. 

15 A method for producing an antibody of interest, such as a human antibody, is disclosed in 

U.S. Patent No. 5,916,771 . It includes introducing an expression vector that contains a 
nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing 
an expression vector containing a nucleotide sequence encoding a light chain into another 
mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 

20 antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 
epitope on an immunogen, and a correlative method for selecting an antibody that binds 
immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication 
WO 99/53049. 

25 

5.13.4 F a b Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of single-chain 
antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of F a b expression libraries (see e.g., 
30 Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of 

monoclonal F ab fragments with the desired specificity for a protein or derivatives, fragments, 
analogs or homologs thereof. Antibody fragments that contain the idiotypes to a protein antigen 
may be produced by techniques known in the art including, but not limited to: (i) an F^b^ 
fragment produced by pepsin digestion of an antibody molecule; (ii) an F a b fragment generated 
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by reducing the disulfide bridges of an F( a b')2 fragment; (iii) an F a b fragment generated by the 
treatment of the antibody molecule with papain and a reducing agent and (iv) F v fragments. 


5.13.5 Bispecific Antibodies 

5 Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 

have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit 
Methods for making bispecific antibodies are known in the art. Traditionally, the 

10 recombinant production of bispecific antibodies is based on the co-expression of two 

immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature. 305:537-539 (1 983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture of ten different antibody molecules, of which only one has the correct 

15 bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993, and in Traunecker et ah, 1991 EMBO J., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 

20 preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 
light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 

25 host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
al., Methods in Enzvmology. 121 :210 (19861 

According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which are 
recovered from recombinant cell culture. The preferred interface comprises at least a part of the 

30 CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. 
tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino 
acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 

35 increasing the yield of the heterodimer over other unwanted end-products such as homodimers. 
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Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 
F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in the literature. For example, bispecific antibodies can be 
prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure 
5 wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is 

10 mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe 

15 the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab' fragment 
was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 
of human cytotoxic lymphocytes against human breast tumor targets. 

20 Various techniques for making and isolating bispecific antibody fragments directly from 

recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucine zippers. Kostelny et al., J. Immunol. 148(5):1547-1553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 
different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 

25 to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fragments. The fragments comprise a 
heavy-chain variable domain (Vh) connected to a light-chain variable domain (Vl) by a linker 

30 which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the Vh and Vl domains of one fragment are forced to pair with the complementary Vl and Vh 
domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 
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Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). 
Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 
5 immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on 
a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for 
IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD16) so as to focus cellular 
defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies can also 
be used to direct cytotoxic agents to cells which express a particular antigen. These antibodies 
1 0 possess an antigen-binding arm and an arm which binds a cytotoxic agent or a radionuclide 
chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of interest 
binds the protein antigen described herein and further binds tissue factor (TF). 

5.13.6 Heterocon jugate Antibodies 

15 Heteroconjugate antibodies are also within the scope of the present invention. 

Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 
have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). 
It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic 

20 protein chemistry, including those involving crosslinking agents. For example, immunotoxins 
can be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 

25 5.13.7 Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain 
disulfide bond formation in this region. The homodimeric antibody thus generated can have 

30 improved internalization capability and/or increased complement-mediated cell killing and 
antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med., 176: 1 191- 
1 195 (1992) and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with 
enhanced anti-tumor activity can also be prepared using heterobifunctional cross-linkers as 
described in Wolff et al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can 
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be engineered that has dual Fc regions and can thereby have enhanced complement lysis and 
ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

5.13.8 Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to 
cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of 
bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 
described above. Enzymatically active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples include 
212 Bi, ,3, I, 13, In, 90 Y,and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional 
protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 
iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 
active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 
compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a 
ricin immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). 
Carbon-14-labeled l-isothiocyanatobenzyl-3-methyldiethylene friaminepentaacetic acid (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 


4.14 COMPUTER READABLE SEQUENCES 
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In one application of this embodiment, a nucleotide sequence of the present invention can 
be recorded on computer readable media. As used herein, "computer readable media" refers to 
any medium which can be read and accessed directly by a computer. Such media include, but 
are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 
artisan can readily appreciate how any of the presently known computer readable mediums can 
be used to create a manufacture comprising computer readable medium having recorded thereon 
a nucleotide sequence of the present invention. As used herein, "recorded" refers to a process for 
storing information on computer readable medium. A skilled artisan can readily adopt any of the 
presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 
readable medium. The sequence information can be represented in a word processing text file, > 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 
formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

By providing any of the nucleotide sequences SEQ ID NO: 1-5497 or a representative 
fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 
sequences of SEQ ID NO: 1-5497 in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. The examples which follow demonstrate how software which implements the 
BLAST (Altschul et ah, J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al, Comp. 
Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments 
and may be useful in producing commercially important proteins such as enzymes used in 
fermentation reactions and in the production of commercially useful metabolites. 
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As used herein, "a computer-based system 1 ' refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the present 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently available 
computer-based systems are suitable for use in the present invention. As stated above, the 
computer-based systems of the present invention comprise a data storage means having stored 
therein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for supporting and implementing a search means. As used herein, "data storage 
means" refers to memory which can store nucleotide sequence information of the present 
invention, or a memory access means which can access manufactures having recorded thereon 
the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence which match a particular target sequence or target 
motif. A variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 
systems of the present invention. Examples of such software includes, but is not limited to, 
Smith-Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A 
skilled artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present 
computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will be 
present as a random occurrence in the database. The most preferred sequence length of a target 
sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fragments involved in gene expression and protein processing, may be of 
shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif. There are 
a variety of target motifs known in the art. Protein target motifs include, but are not limited to, 
enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
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to, promoter sequences, hairpin structures and inducible expression elements (protein binding 
sequences). 

4.15 TRIPLE HELIX FORMATION 

5 In addition, the fragments of the present invention, as broadly described, can be used to 

control gene expression through triple helix formation or antisense DNA or RNA, both of which 
methods are based on the binding of a polynucleotide sequence to DNA or RNA. 
Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 

10 Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et aL, Science 15241:456 (1988); and Dervan 
et aL, Science 251:1360 (1991)) or to the mRNA itself (antisense - Olmno, J. Neurochem. 
56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA 

1 5 molecule into polypeptide. Both techniques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 
design of an. antisense or triple helix oligonucleotide. 

4.16 DIAGNOSTIC ASSAYS AND KITS 

20 The present invention further provides methods to identify the presence or expression of 

one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 
with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 

25 contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 
detected, a polynucleotide of the invention is detected in the sample. Such methods can also 
comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 

30 polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 
detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise contacting 
a sample with a compound that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 

35 polypeptide of the invention is detected in the sample. 
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In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One 
skilled in the art will recognize that any one of the commonly available hybridization, 
amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 
Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice 
and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the 
present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described method 
will vary based on the assay format, nature of the detection method and the tissues, cells or 
extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 
extracts of cells are well known in the art and can be readily be adapted in order to obtain a 
sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the invention 
provides a .compartment kit to receive, in close confinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of a bound probe or antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in separate 
containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 
agents or solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container which will accept the test 
sample, a container which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which 
contain the reagents used to detect the bound antibody or probe. Types of detection reagents 
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include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the 
primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 
probes and antibodies of the present invention can be readily incorporated into one of the 
established kit formats which are well known in the art. 


4.17 MEDICAL IMAGING 

The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
See, e.g., Kunkel et aL, U.S. Pat NO. 5,413,778. Such methods involve chemical attachment of 
a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 
pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a polypeptide 
encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO: 
1-5497, or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, 
said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the present 
invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
the invention for a time sufficient to form a polynucleotide/compound complex, and detecting 
the complex, so that if a polynucleotide/compound complex is detected, a compound that binds 
to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 
polypeptide of the invention can comprise contacting a compound with a polypeptide of the 
invention for a time sufficient to form a polypeptide/compound complex, and detecting the 
complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 
polynucleotide of the invention is identified. 
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Methods for identifying compounds that bind to a polypeptide of the invention can also 
comprise contacting a compound with a polypeptide of the invention in a cell for a time 
sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a 
receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
sequence expression, so that if a polypeptide/compound complex is detected, a compound that 
binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 
activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 
activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods can include compounds which modulate the expression of a polynucleotide of the 
invention (that is, increase or decrease expression relative to expression levels observed in the 
absence of the compound). Compounds, such as compounds identified via the methods of the 
invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 
the like are selected at random and are assayed for their ability to bind to the protein encoded by 
the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein. For example, one skilled in the art can 
readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as broadly 
described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues which hybridize or form a triple helix formation 
by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, 
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ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have 

base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 

designed to be complementary to a region of the gene involved in transcription (triple helix - see 
5 Lee et al., Nuch Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et 

ah, Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 

(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 

Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 

from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
10 polypeptide. Both techniques have been demonstrated to be effective in model systems. 

Information contained in the sequences of the present invention is necessary for the design of an 

antisense or triple helix oligonucleotide and other DNA binding agents. 

Agents that bind to a protein encoded by one of the ORFs of the present invention can be 

used as a diagnostic agent. Agents which bind to a protein encoded by one of the ORFs of the 
1 5 present invention can be formulated using known techniques to generate a pharmaceutical 

composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 
20 hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NO: 1-5497. Because the corresponding gene is only expressed in a limited 
number of tissues, a hybridization probe derived from of any of the nucleotide sequences SEQ 
ID NO: 1-5497 can be used as an indicator of the presence of RNA of cell type of such a tissue 
25 in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PGR as described in US Patents Nos. 4,683,1 95 and 4,965,1 88 provides 
additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PGR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The 

30 probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 

Other means for producing specific hybridization probes for nucleic acids include the 
cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 

35 in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 
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polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 
5 techniques include in situ hybridization, linkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
chromosome spreads has been described, among other places, in Verma et al (1988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

1 0 Fluorescent in situ hybridization of chromosomal preparations and other physical 

chromosome mapping techniques may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of Science (265: 198 If). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 

1 5 genetic disease. The nucleotide sequences of the subject invention may be used to detect 
differences in genie sequences between normal, carrier or affected individuals. 

4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i e. , small nucleic acid segments, may be readily prepared by, for 
example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 

20 using an automated oligonucleotide synthesizer. 

Support bound oligonucleotidesmay be prepared by any of the methods known to those of 
skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be 
achieved using passive adsorption (Inouye & Hondo, ( 1 990) J. Clin. Microbiol. 28(6) 1 469-72); 

25 using UV light (Nagata etal, 1985;Dahlene/a/., 1 987; Morrissey& Collins, (1989) Mol. Ceil 
Probes 3(2) 1 89-207) or by covalent binding of base modified DNA (Keller et at., 1 988; 1 989); all 
references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al (1 994) Proc. Natl. Acad. Sci. USA 91 (8) 3072-6, 

30 describe the use of biotinylated probes, although these are duplex probes, that are immobilized on 
streptavidin-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, 
Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. 
Biotinylated probes may be purchased from various sources, such as, e.g., Operon Technologies 
(Alameda, CA). 


WO 01/79449 




PCT/US01/08656 


Nunc Laboratories^Naperville, IL) is also selling suitable material that could be used. Nunc 
Laboratories have developed a method by which DNA can be covalently bound to the microwell 
surface termed Covalink NH. CovaLinkNH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupling. Co vaLink Modules may be 
5 purchased from Nunc Laboratories. DNA molecules may be bound to Co vaLink exclusively at the 
5 -end by a phosphoramidatebond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussene/a/., (1991) Anal. Biochem. 198(1)138-42). 

The use of CovaLink NH strips for covalent binding of DNA molecules at the 5-end has 
been described (Rasmussen et ah, (1 991 ). In this technology, a phosphoramidate bond is employed 
10 (Chu et aL, (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as immobilization using 
only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the 
CovaLinkNH secondary amino groups that are positioned at the end of spacer arms covalently 
grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to 
CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus must have a 5-end 
1 5 phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and 
then streptavidin used to bind the probes. 


denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 1-methylimidazole, 
pH 7.0 (1 -Melm 7 ), is then added to a final concentration of 1 0 mM 1 -Melm?. A ss DNA solution is 

20 then dispensed into CovaLink NH strips (75 ul/well) standing on ice. 

Carbodiimide0.2 M l-ethyl-3-(3-dime%laminopropyl>carbodiimide(EDC), dissolved in 
1 0 mM 1 -Melm 7 , is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 
50°C. After incubation the strips are washed using, e.g. , Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 

25 3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 
described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3 -reagent through the phosphate group by a covalent phosphodiester link to aliphatic 

30 hydroxyl groups carried by the support. The oligonucleotide is then synthesized on the supported 
nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support. Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 


35 arrays may be employed. For example, addressable laser-activated photodeprotection may be 


More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 


An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
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employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Fodor et al (1 991) Science 251(4995) 767-73, incorporated herein by reference. Probes may also 
be immobilized on nylon supports as described by Van Ness et al (1991) Nucleic Acids Res. 
19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1 988) Anal. Biochem. 

5 1 69(1) 104-8; all references being specifically incorporated herein. 

To link an oligonucleotide to a nylon support, as described by Van Ness et al (1 99 1), 
requires activation of the nylon surface via alkylation and selective activation of the 5-amine of 
oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 

10 light-generated synthesis described by Pease et al, (1994) PNAS USA 91(11) 5022-6, incorporated 
herein by reference). These authors used current photolithographic techniques to generate arrays of 
immobilized oligonucleotide probes (DNA chips). These methods, in which light is used to direct 
the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 
5-protected A^-acyl-deoxynucleosidephosphoramidites, surface linker chemistry and versatile 

1 5 combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be 
generated in this manner. 

4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
20 including mRNA without any amplification steps. For example, Sambrook et al (1 989) describes 
three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 
9.14-9.23). 

DNA fragments may be prepared as clones in Ml 3, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplificationmethods. Samples 

25 may be prepared or dispensed in multiwell plates. About 1 00- 1 000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of skill 
in the art including, for example, using restriction enzymes as described at 924-9.28 of Sambrook et 
al ( 1 989), shearing by ultrasound and NaOH treatment. 

30 Low pressure shearing is also appropriate, as described by Schriefer et al (1 990) Nucleic 

Acids Res. 1 8(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
device allows controlled application of low to intermediate pressures to the cell. The results of 
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these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA 
fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
base recognition endonuclease, Cvi JI, described by Fitzgerald et al (1 992) Nucleic Acids Res. 
5 20(14) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and 
sequencing. 

The restriction endonuclease Cv/JI normally cleaves the recognition sequence PuGCPy 
between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 

10 this enzyme (Cv/JI**), yield a quasi-random distribution of DNA fragments form the small 
moleculepUC19 (2688 base pairs). Fitzgerald et al (1992) quantitatively evaluated the 
randomness of this fragmentation strategy, using a Cv/JI* * digest of pUC 1 9 that was size 
fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus 
M 1 3 cloning vector. Sequence analysis of 76 clones showed that Cv/JI* * restricts py GCPy and 

1 5 PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and 
agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 
ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 

20 electrophoresis and elutioh are needed. 

Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, it is 
important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the 

25 chip. Phosphate groups must also be removed from genomic DNA by methods known in the art. 

4.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 
array of wells in a microliter plate) to repeated by transfer of about 20 nl of a DNA solution to a 
30 nylon membrane. By offset printing, a density of dots higher than the density of the wells is 

achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 
may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 
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subarrays may represent replica spotting of the same samples. In one example, a selected gene 
segment may be amplified from 64 patients. For each patient, the amplified gene segment may be 
in one 96- well plate (all 96 wells containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane. 
5 Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 
dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 
which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid 
being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
1 0 strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 
screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration of the 
present disclosure, one of skill in the art will appreciate that many other embodiments and variations 
may be made in the scope of the present invention. Accordingly, it is intended that the broader 
1 5 aspects of the present invention not be limited to the disclosure of the following examples. The 
present invention is not to be limited in scope by the exemplified embodiments which are intended 
as illustrations of single aspects of the invention, and compositions and methods which are 
functionally equivalent are within the scope of the invention. Indeed, numerous modifications and 
variations in the practice of the invention are expected to occur to those skilled in the art upon 
20 consideration of the present preferred embodiments. Consequently, the only limitations which 
should be placed upon the scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated by 
reference in their entirety. 

5.0 EXAMPLES 

25 5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 
A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various 
human tissues and in some cases isolated from a genomic library derived from human chromosome 
using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The 

30 inserts of the library were amplified with PCR using primers specific for the vector sequences 
which flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and 
screened with oligonucleotide probes (e.g., 7-mers) to obtain signature sequences. The clones were 
clustered into groups of similar or identical sequences. Representative clones were selected for 
sequencing. 
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In some cases, the 5' sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 
terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosyst< 
(ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Rapid 
Amplification of cDNA Ends) was performed to further extend the sequence in the 5' direction. 


5.2 EXAMPLE 2 

Novel Conties 

The novel contigs of the invention were assembled from sequences that were obtained from 
a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 
from one or more public databases. The sequences for the resulting nucleic acid contigs are 
designated as SEQ ID NO: 1 -5497 and are provided in the attached Sequence Listing. The contigs 
were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 
(i.e., Hyseq's database containing EST sequences, dbEST version 1 15, gbpri 1 15, and UniGene 
version 1 03, and exons from public domain genomic sequences predicted by GenScan) that belong 
to this assemblage. The algorithm terminated when there was no additional sequences from the 
above databases that would extend the assemblage. Further, the inclusion of component sequences 
into the assemblage was based on a BLASTN hit to the extending assemblage with BLAST score 
greater than 300 and percent identity greater than 95%. 

The novel predicted polypeptides (including proteins) encoded by the novel polynucleotides 
(SEQ ID NO: 1 -5497) of the present invention are incorporated in the attached Sequence Listing. A 
subset of the predicted polypeptide sequences contain an unknown amino acid; a stop codon; a 
possible nucleotide deletion; or a possible nucleotide insertion. These sequences have also been 
shown in their entirety in Table 2. Table 2 also shows the corresponding start and stop nucleotide 
locationstoeachofSEQIDNO: 1-5497. Table2 also indicates the method by which the 
polypeptide was predicted. Method A refers to a polypeptide obtained by using a software program 
called FASTY (available from http://fasta.bioc h.virginia.edu , > which selects a polypeptide based on 
a comparison of the translated novel polynucleotide to known polynucleotides (W.R. Pearson, 
Methods in Enzymology, 183:63-98 (1990), herein incorporated by reference). Method B refers to 
a polypeptide obtained by using a software program called GenScan for human/vertebrate 
sequences (available from Stanford University, Office of Technology Licensing) that predicts the 
polypeptide based on a probabilistic model of gene structure/compositionalproperties (C. Burge 
and S. Karlin, J. Mol. Biol., 268:78-94 (1997), incorporated herein by reference). Method C refers 
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to a polypeptide obtained by using a Hyseq proprietary software program that translates the novel 
polynucleotide and its complementary strand into six possible amino acid sequences (forward and 
reverse frames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-5497 were obtained by a BLASTX 
5 version 2*0al 1 9MP-WashU search against Genpept release 122 and Geneseq release 200105 
(Derwent), using BLAST algorithm. The nearest neighbor result showed the closest homologue 
for SEQ ID NO: 1-5497. The nearest neighbor results for SEQ ID NO: 1-5497 are incorporated 
in the attached Sequence Listing. 

Using eMatrix software package (Stanford University, Stanford, CA) (Wu et al., J. 

10 Comp. Biol., Vol. 6 pp. 219-235 (1999) herein incorporated by reference), all the sequences were 
examined to determine whether they had identifiable signature regions. The attached Sequence 
Listing provides the results obtained by eMatrix analysis for each polypeptide as follows: the 
signature region found in the indicated polypeptide sequences, the description of the signature, 
the eMatrix p-value(s) and the position(s) of the signature within the polypeptide sequence. 

15 Using the pFam software program (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1) 

pp. 320-322 (1998) herein incorporated by reference) all the polypeptide sequences were 
examined for domains with homology to certain peptide domains. The attached Sequence 
Listing provides the results obtained by pFam analysis for each polypeptide, namely: the name of 
the domain found, the description, the p-value and the pFam score for the identified domain 

20 within the sequence. 

Tables 1 and 2 follow. Table 1 shows the various tissue sources of SEQ ID NO: 1-5497. 
Table 2 shows the start and stop nucleotides for the translated amino acid sequence for which each 
assemblage encodes. Table 2 also provides a correlation between the amino acid sequences set forth 
in the Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID 

25 NO: in USSN 09/770,160. 
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Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 

adult 
brain 

GIBCO 

AB3001 

81-82 126 136 154-156 175-177 213-215 
278-283 346-349 445-446 459 491-492 543 
561-562 652-653 709-711 755-757 794-795 
822-823 899 924 971-988 995 997-998 1017- 
1021 1026-1027 1036-1037 1048 1085 1128 
1143 1154 1173 1202-1204 1269-1270 1290- 
1291 1300-1301 1320-1321 1353-1355 1357- 
1359 1363-1371 1388 1394-1396 1410 1415- 
1417 1422-1424 1426 1455-1456 1465-1470 
1508-1510 1533-1535 1541-1546 1550 1580- 
1581 1585 1588-1589 1592 1603-1608 1648 
1655 1663 1674-1682 1685 1709 1719-1721 
1723 1727-1734 1746 1753 1755-1756 1773- 
1774 1805-1806 1827-1829 1839-1847 1876- 
1877 1915-1918 1951 2005 2021-2024 2027- 
2034 2042-2043 2054 2057 2072-2074 2092 
2096-2097 21 18 2144-2145 2177 2188-2190 
2193-2195 2208-2210 2214-2215 2251-2252 
2281-2283 2288-2291 2294-2299 2331 2344 
2382 2417-2420 2422 2430 2437 2439-2441 
2446 2456 2483 2496 2499 2510-2513 2552 
2656 2686 2741-2743 2746-2747 2774-2778 
2783 2786 2842-2843 2857-2860 2865 2873- 
2874 2879-2881 2883-2884 2960-2962 2976- 
2977 3009 3136-3137 3139-3148 3167-3168 
3170-3171 3174 3198 3207 3213-32143220- 
3222 3230 3240 3257-3259 3276-3277 3280- 
3282 3289-3290 3304-3307 3323-3324 3345- 
3346 3394-3395 3456 3477-3478 3536-3543 
3558-3562 3587 3689 3694-3696 3729-3730 
3737-3738 3772 3822-3825 3831-3833 3864- 
3865 3891 3963-3965 4001 4055-4056 4060- 
4061 4093 4098 41 12-41 13 4123 4125 4136- 
4141 4230-4231 4273-4274 4291-4295 4520 
4546-4548 4569-4571 4575-4576 4691-4692 
4740-4741 4796-4797 4804-4805 4864-4865 
4900 4907-4909 5148-5149 5276-5277 5295- 
5296 5298-5302 5464-5466 

adult 
brain 

GIBCO 

ABD003 

1-11 52 64 81-82 123 154-156 175-177233 
248 258-260 278-283 313-315 335 339 354 
357-361 365 379-380 388-390 394 459 491- 
492 557 561-562 574-577 582 597-598 607 
652-653 670-671 677-678 682-684 719-722 
743-744 794-795 799-800 814-816 818 822- 
823 840-844 863-869 873-875 878 882-886 
889-897 909-914 916-920 924 927 930-936 
944-960 964-966 969 971-988 993-995 997- 
999 1008-1009 1017-1021 1023-1027 1036- 
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Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




1037 1042-1048 1050-1051 1053-1054 1063- 
1068 1070-1071 1075 1089-1091 1110-1113 
1117-1121 1128-1136 1143 1154 1156 1159- 
1164 1172 1175 1180-1184 1198-1204 1217- 
1218 1235-1236 1244-1246 1249-1255 1269- 
1273 1281-1282 1297 1300-1301 1307 1315- 
1316 1319-1322 1349-1350 1352-1355 1357- 
1358 1374 1388-1393 1398-1399 1410 1413 
1422-1424 1426 1438-1441 1446-1449 1451- 
1456 1463-1466 1473 1478-1479 1485 1498- 
1499 1507-1510 1516-1517 1532 1536-1539 
1541-1546 1551-1552 1559-1560 1580-1581 
1588-1589 1605-1608 1612 1620-1623 1639 
1648 1654 1661-1663 1665 1671-1673 1685 
1688-1690 1694-1699 1703-1704 1708-1709 
1715-1716 1719-1721 1723 1727 1737-1739 
1743-1746 1753-1756 1765-1769 1780-1783 
1805-1817 1831-1838 1845-1851 1860 1870- 
1875 1878 1900-1911 1915-1922 1926-1927 
1951-1962 1964-1965 1978-1979 1981-1983 
1990-1991 2000-2002 2005 2010-2013 2027- 
2030 2038 2042-2043 2048 2050-205 1 2057- 
2061 2066-2067 2072-2074 2083-2084 2086- 
2087 2092-2093 2096-2102 2107 21 15-21 16 
2118 2125-2130 2144 2146-2147 2177 2186- 
2188 2214-2215 2223 2230-2232 2251-2252 
2254 2258-2260 2267-2270 2273 2281-2282 
2284 2288-2291 2296-2299 2310 2318-2320 
2324 2331 2333-2334 2377-2382 2389-2390 
2403-2404 241 6-241 7 241 9-2424 2430 2439- 
2441 2444 2446-2447 2467 2469 2475 2483 
2488 2499 2510-2513 2536-2538 2573-2575 
2592 2594-2595 2597 2603-2604 2628-2632 
2644-2648 2656 2666 2668 2672-2674 2677- 
2680 2686 2696-2697 2726 2734 2745-2748 
2751 2760 2763-2764 2768-2771 2777-2778 
2780-2783 2786 2805-2806 28 14 2820 2824- 
2826 2828 2836-2839 2843 2854 2857-2860 
2865 2894-2897 2906 2914-2917 2925-2929 
2954 2960 2964 2969-2973 2996-2998 3009 
3035-3036 3054 3084-3085 3088-3089 3094- 
3095 3100 3110 3133-3135 3139-3148 3151- 
3152 3158 3167-3168 3170-3173 3189-3191 
3195 3199 3203-3204 3213-3214 3219 3223 
3226-3228 3230-3233 3253-3255 3257-3259 
3276-3277 3280-3282 3288-3290 3310-3311 
3313 3323-3324 3331-3332 3339-3340 3345- 
3346 3372-3373 3409-3417 3442 3477-3478 
3491-3495 3505-3506 3536-3543 3554-3556 
3558-3560 3576 3587-3589 3599-3601 3628- 


WO 01/79449 


PCT7US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




3630 3638-3640 3693 3729-3730 3810 3815- 
3821 3830-3833 3867-3869 3882-3883 3911- 
3919 3941-3942 3955-3957 3959-3960 3962 
4014-4015 4040-4045 4055-4056 4062-4064 
4076 4087-4093 4101-4102 4112-4116 4123 
4125 4127-4130 4136-4137 4140-4141 4241- 
4242 4252-4253 4285-4286 4291-4295 4335- 
4337 4439 4546-4549 4552 4570-4571 4598- 
4601 4616-4621 4652-4653 4679 4691-4692 
4740-4741 4764-4766 4796-4797 4807-4808 
4862-4863 4907-4909 4971 4976-4979 5021- 
5023 5137-5140 5177-5179 5181-5183 5276- 
5277 5284-5289 5291 5295-5296 5308-5309 
5389-5391 5399-5401 5421-5422 5427-5429 
5443-5444 5464-5466 5485 5490-5491 

adult 
brain 

Clontech 

ABR001 

3-11 70 137 175-177 478 491-492 597-598 
755-757 796 852-854 910-914 964-966 1026- 
10271049 1158-11591173 1198-1201 1271 
1274 1281-1282 1297 1351 1363-1371 1389 
1479 1671-1673 1685 1719-1721 1784-1790 
1870-1875 1900-1902 1919-1922 1926-1927 
2035 2072-2074 2224-2228 2298-2299 2305- 
2306 2404 2419-2420 2436 2528-2531 2644 
2713-2714 2751 2762 2774-2776 2786 2978 
3113 3151-3152 3181-3183 3213-3214 3424- 
3427 3554-3556 3577-3582 3587 3595-3596 
3638-3640 3663 3742-3744 3853 391 1-3919 
3931 3941-3942 3962 4036-4039 4077-4079 
4125 4220-4223 4320 4545 4549 4570-4571 
4672-4674 4738-4739 4764-4766 4781 4815 
4910 5001-5003 5435-5437 5464-5466 

adult 
brain 

Clontech 

ABR006 

47 126 130 154-156 278-283 395 561-562 
583-590 661-662 709-711 855-856 889-897 
903-905 909 945 961-962 1063-1067 1069 
1088 1095 1154-1155 1235-1236 1281-1282 
1349-1350 1360-1362 1394 1418-1420 1580- 
1584 1626 1634-1637 1671-1673 1688-1689 
1694-1698 1715-1716 1728-1734 1763-1764 
1770-1771 1773-1774 1839-1844 1903-1911 
1913-1914 2027-2030 2035 2054-2056 2076- 
2077 2121-2124 2145 2163-2168 2188 2197- 
2199 2214-2215 2445 2591-2592 2598 2650 
2686 2737-2738 2745 2774-2778 2857-2860 
3323-3324 3328-3330 3342-3344 3354 3396- 
3398 3498-3501 3536-3543 3658-3660 3856- 
3857 4300-4308 4379-4380 4410-4412 4451- 
4452 4481-4489 4549 4624-4626 4660 4824- 
4826 4832-4834 4967-4970 5050-5052 5278- 
5279 

adult 

Clontech 

ABR008 

30-31 39-40 45-46 62 74-77 81-82 1 16-119 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 

brain 



129-130 136-137 143-148 154-156 175-177 
187-190 195-196 216-218 227-230 254 258- 
260 294-295 301-303 313-315 340 388-390 
395-398 400-404 407-410 413-414 435 449 
465 493-495 499 509 520-521 531-532 545 
557-560 579-581 592-594 602-603 607 611 
629-630 647-649 652-653 659-660 675-676 
685 697-698 709-711 747-750 758-759 796 
804-808 817-818 829-831 836-837 840-844 
885-886 903-905 909-914 916-918 924 944 
946-949 957-960 971-974 993-994 997-1002 
1017-1021 1028 1038-1040 1042-1049 1053- 
1054 1070-1072 1076 1095 1117-1119 1128- 
1136 1139-1143 1151-1154 1160-1164 1175 
1182-1184 1192 1202-1204 1222-1223 1228 
1230-1232 1235-1236 1271 1278-1282 1285 
1294-1296 1320-1321 1323-1327 1349-1350 
1353-1354 1357-1359 1380 1383-1384 1386- 
1387 1389 1398-1401 1403-1404 1407 1411 
1421-1423 1426-1432 1446-1449 1451-1456 
1463-1464 1479-1480 1485 1488 1491-1494 
1508-1510 1527-1538 1547-1548 1557 1580- 
1584 1605-1608 1629-1632 1634-1638 1640- 
1645 1648 1667-1670 1685 1691-1692 1694- 
1698 1701-1704 1706-1709 1715-1716 1724 
1727 1737-1739 1742-1746 1754 1765-1769 
1773-1774 1780-1783 1796-1817 1827-1829 
1839-1844 1848-1851 1870-1875 1879-1885 
1896-1897 1900-1911 1915-1922 1926-1927 
1950 1952-1962 1966-1974 1978-1979 1981- 
1983 1990-1991 2005-20072010-2013 2017- 
2020 2025-2030 2040-2041 2044 2048 2052- 
2053 2055-2056 2058-2059 2062-2064 2068- 
2074 2076-2079 2095 2118 2134-2136 2138- 
2142 2144-2147 2161-2162 2174-2177 2186- 
2188 2191-2199 2204-2215 2223-2233 2254- 
2257 2281-2282 2286-2291 2347-2356 2362 
2380-2381 2419-2420 2437 2456 2464 2496 
2511-2513 2534-2536 2548 2554-2556 2592 
2596 2603-2604 2626 2629-2631 2633-2637 
2645-2647 2650-2655 2657-2658 2665 2669- 
2671 2675-2680 2696-2697 2702-2705 2709- 
271 1 2728-2729 2749-2750 2762 2777-2778 
2784 2828-2829 2843 2846-2850 2857-2862 ■ 
2865 2869-2870 2885-2888 2904 2925-2929 
2931-2939 2945-2946 2955 2969-2973 3084- 
3085 3118 3136-3137 3172-3173 3196 3208- 
3209 3213-3216 3219 3229-3230 3234 3240 
3243 3304-3307 3312 3331-3332 3342-3346 
3371 3403-3404 3406-3407 3424-3427 3444- 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ U> NOS: 




3445 3466-3468 3477-3478 3498-3504 3514 
3516-3517 3535 3554-3556 3577-3580 3583 
3591-3593 3606-3607 3629-3630 3652 3729- 
3730 3742-3744 3760 3810 3817-3821 3831- 
3833 3845 3856-3857 3941-3942 3959-3960 
3963-3965 4036-4039 4077-4079 4082-4084 
4098-4099 4108 4114-4116 4125 4132-4133 
4157-4158 4211 4291-4295 4329-4331 4335- 
4337 4343-4347 4376-4377 4391-4393 4403 
4420-4424 4461-4464 4480 4542-4544 4549 
4552 4570-4571 4574 4581-4582 4614-4615 
4622-4623 4633-4635 4660 4672-4674 4691- 
4692 4740-4741 4748-4749 4751-4755 4764- 
4766 4768-4771 4785 4789 4823 4845 4882 
4886 4900 4903 4905-4906 4910 4918-4920 
4928 4946 4952 4967-4970 5000-5003 5024 
<5n50-5059 5109 5193-5174 5190-5192 5229- 
5224 5241-5242 5259 5275 5278-5279 5298- 
5302 5329-5330 5381-5383 5387-5391 5448- 
5450 5458 

adult 
brain 

Clontech 

ABR011 

154-156 388-390 1076 1128 1182-1184 1193 
1202-1204 1422-1424 2072-2074 2144 2251- 
2252 2549-2550 4062-4064 5298-5302 

auuu 
brain 

rnoL^nam 

/\J3lw 1 Z 

AA(\ AA\ £ft9_fi(Y} 889.807 097.098 1 589- 

'HtU-H'tI OUZ - Ovj OOJ'-OJ'/ 77/-770 1JOZ" 

1584 1719-1721 1780-1783 2072-2074 2186- 
2187 2223 2377-2379 3394-3395 4291-4295 
4581-4582 5123-5124 

aaun 
brain 

inviuogen 

r\DS\\J ID 

'*41_'344 491.4.9? 19fK-1907 1SR0-1S81 
1599-1602 1857-1859 1925 2072-2074 2186- 
2187 2208-2210 2377-2379 2469 3250-3252 
3304-3307 4267-4270 4796-4797 

adult 
brain 

Invitrogen 

ABT004 

12-13 38 52 70 92-95 126 175-177 255-257 
291 341-344 346-349 354 478 557 583-590 
612-620 675 789-793 796 840-844 871-872 
879-884 909 919-920 964-966 997-998 1017- 
1021 1026-1027 1042-1043 1051 1070-1071 
1076 1088 1108 1151-1153 1160-1164 1193 
1217-1218 1228-1229 1269-1270 1281-1282 
1320-1321 1349-13501385 1427-1431 1467- 
1469 1485 1532 1575-1576 1626 1629-1632 
1640-1645 1708 1715-1716 1727 1742-1746 
1773-1774 1799-1804 1807-1813 1852 1860 
1865-1875 1900-1911 1948-1949 1954-1962 
1964-1965 1981-1983 1990-1991 2010-2013 
2036-2037 2054 2072-2074 2078-2082 2086- 
2087 2143-2147 2174-2176 2186-2187 2224- 
2228 2231-2232 2255-2257 2264 2284 2310- 
2312 2369-2375 2397-2399 2419-2420 2436 
2526-2527 2592 2604 2624 2626 2629-2631 
2696-2697 2734 2751 2785 2813 2857-2860 


WO 01/79449 PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




2886-2887 2925-2929 2981-2984 3124-3128 
3195 3212-3214 3219-3222 3253-3255 3258- 
3259 3313 3339-3340 3403-3404 3411-3417 
3466-3468 3508-3513 3554-3556 3599-3601 
3661-3662 3693 3835 3959-3960 3997-3998 
4036-4039 4060-4061 4068-4069 4077-4079 
4093 4124 4131 4291-4295 4297-4299 4330- 
4331 4379-4380 4391-4393 4542-4544 4549 
4569 4672-4674 4861 4870 4900 4923-4925 
5000 5021-5023 5222-5224 5282 5387-5388 

cultured 

preadipoc 

ytes 

Stratagene 

ADP001 

74-77 134 154-156 175-177 201-205 213-215 
278-283 313-315 489-492 520-521 652-653 
670-671 680-684 736-740 743-744 784-786 
796 814-816 822-823 857-859 885-886 944 
950 964-966 994 1028 1042-1043 1052 1069- 
1071 1089-1091 1129-11301143 11541156 
1172 1198-1204 1249-1255 1278-1280 1317- 
1318 1320-1321 1351 1359 1380 1410 1455- 
14561473 1507 1532-1535 1547-1548 1553- 
1556 1559-1560 1588-1589 1611 1617-1619 
1640-1645 1648 1663 1666 1723-1724 1727 
1746 1755-1756 1765-1769 1773-1774 1780- 
1783 1839-1844 1870-1877 1925 1990-1991 
2060-2061 21 18 2193-2195 2197-2199 2223 
2234-2242 2298-2299 2310 2331 2380-2381 
2443 2452-2454 2524-2525 2572-2573 2591- 
2592 2594-2595 2604 2672-2674 2709-271 1 
2734 2739 2819 2843-2847 2861-2862 2899- 
2900 2913 2925-2929 2979 2985 3013-3014 
3 1 59-3 1 62 3 1 8 1 -3 1 83 3 1 89-3 191 3220-3222 
3253-3255 3285 3310-3311 3462 3486-3487 
3587 3638-3640 3673-3677 3754 3804-3806 
3815-3816 3871-3872 3969-3971 4014-4015 
4036-4039 4068-4069 4140-4141 4241-4242 
4254 4341 4534 4554-4555 4570-4571 4581- 
4582 4622-4623 4740-4741 4864-4865 4910 
5001-5003 5038-5039 5095-5097 5137-5140 

adrenal 
gland 

Clontech 

ADR002 

1-2 12-13 35 52 62 100-106 121-122 140-142 
153-156 191-192 213-215 221 232 301-303 
306 313-315 341-344 366-367 394 459 491- 
492 513 551-553 583-590 592-595 652-653 
670-671 719-722 728-733 743-744 747-750 
755-757 772 784-786 814-816 847 849-851 
889-897 909-914 916-920 944 946-949 961- 
962 993-995 997-999 1049 1070-1071 1078 
1089-1091 1117-1119 1128 1151-1153 1160- 
1164 1175 1182-1184 1193 1220-1221 1269- 
1270 1272-1273 1287 1307 1352 1355 1357- 
1359 1407 1415-1417 1422-1423 1480 1485 
1498-1499 1505 1507-1510 1526 1541-1546 


WO 01/79449 PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




1553-1556 1580-1581 1585 1597 1613-1619 
1638-1639 1649-1652 1656 1683-1684 1701- 
1704 1708 1715-1717 1727-1736 1747-1751 
1753 1770-1771 1799-1806 1835-1838 1870- 
1877 1896 1903-1911 1915-1922 1950-1951 
1992 2000-2002 2010-2013 2025-2026 2038 
2050-2051 2057-2059 2062-2064 2072-2074 
2086-2087 2115 2144 2153-2155 2179-2181 
2186-2187 2196-2199 2208-2210 2214-2215 
2224-2228 2234-2242 2258-2260 2262 2264 
2267-2270 2296-2299 2324 2327-2330 2347- 
2348 2360 2368 2389-2390 2419-2420 2425- 
2426 2465 2499 2511-2513 2523 2560 2573 
2650 2665 2668 2672-2674 2726 2735 2741- 
2743 2800-2801 2813 2824-2826 2840-2841 
2844-2845 2852-2854 2861-2862 2871-2872 
2894-2895 2898 2909-2911 2925-2929 2931- 
2939 2945-2946 2961-2962 2992 3013-3014 
3017-3022 3026-3027 3035-3036 3054 3076- 
3078 3172-3173 3189-3191 3195 3218 3229 
3242 3250-3252 3285 3309 3316-3318 3323- 
3324 3342-3344 3396-3398 3428-3429 3444- 
3445 3472-3473 3477-3478 3519 3561-3562 
3588 3602-3604 3621-3625 3637 3667 3673- 
3677 3689 3691 3693 3699 3724-3725 3784- 
3786 3807 3851 3860-3861 3877 3936 3955- 
3957 3969-3971 4024-4025 4027 4055-4056 
4060-4061 4076 4082-4084 4100 4112-4116 
4123 4138-4139 4219 4228 4230-4231 4243- 
4244 4287-4288 4330-4331 4342 4382 4391- 
4393 4410-4412 4473 4529 4537-4541 4546- 
4548 4581-4582 4629-4632 4796-4798 4835 
4864-4865 4905-4910 5001-5003 5034-5036 
5190-5192 5210 5234 5278-5279 

adult, 
heart 

GIBCO 

AHR001 

45-46 52 56 100-106 133-134 140-142 154- 
156 173 175-177 192 195-196 201-205 212- 
218 227-230 235 278-283 286-287 301-303 
313-315 323 332-333 341-344 346-352 366- 
367 379-380 395 400-404 413-414 436 469 
478 491-492 511 520-521 531-532 551-553 
557 574-577 583-590 599-601 604 607 612- 
620 652-653 675 677-678 680-685 697 707 
743-744 784-786 789-796 799-800 814-816 
822-823 847 885-886 889-897 915-920 924- 
929 931-936 944-945 950 957-960 964-966 
969 971-979 992 994-1002 1017-1027 1044- 
1050 1052-1054 1056-1057 1063-1067 1070- 
1071 1075 1110-1113 1117-1119 1127-1136 
1139-1143 1154 1156 1159 1172-1173 1182- 
1185 1192-1193 1202-1207 1220-1221 1228 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




1230-1232 1235-1236 1244-1246 1249-1255 
1264 1271 1281-1282 1287 1297 1300-1301 
1315-1319 1322-1327 1330-1331 1349-1350 
1353-1355 1357-1362 1374 1386-1387 1389 
1397 1403-1407 1414 1422-1423 1438 1440- 
1443 1450-1462 1465-1466 1478-1479 1481- 
1485 1489 1498-1499 1507 1515 1524 1532 
: 1539 1541-1546 1550-1556 1569-1576 1579 
1585 1588-1589 1591 1605-1608 1611 1629- 
1632 1639-1652 1663-1664 1666-1673 1685 
1691-1692 1694-1698 1701-1704 1707 1709 
1715-1716 1719-1721 1723-1724 1728-1739 
1746 1753-1756 1765-1769 1780-1783 1792- 
1798 1805-1817 1831-1834 1839-1851 1853 
1860 1865-1869 1876-1877 1879-1885 1897 , 
1919-1922 1925-1927 1948-1962 1978-1979 
1981-1983 2004-2005 2035-2038 2042-2043 
2045-2047 2050-2057 2060-2064 2072-2074 
2086-2091 2096-2097 21 1 1-21 14 21 16 21 1 8 ; 
2144 2146-2147 2151-2160 2173 2179-2183 
2188-2195 2208-2213 2216-2221 2223 2229- 
2232 2234-2242 2251-2252 2254-2257 2262 
2267-22702281-2282 2284 2288-2291 2296- 
2297 2303-2304 2310 2318-2320 2327-2331 
2369-2375 2382 2388 2405-2413 2418 2443 
2452-2454 2467 2470-2472 2488 2499 251 0- 
2513 2534-2538 2558-2559 2574-2575 2579- 
2584 2592 2598 2604 2629-263 1 2633-2636 
2645-2647 2649 2656-2658 2665 2667 2700 
2747-2748 2752-2753 2760-2761 2774-2776 
2781 2785 2800-2801 2808-2809 2818-2819 
2824-2826 2843-2847 2865 2888 2898 2901- 
2903 2914-2917 2931-2939 2944 2954 2960- 
2962 2965-2973 2990 2992 2996-2998 3013- 
3014 3017-3020 3025-3027 3031-3032 3035- 
3036 3054 3075 3084-3085 3096-3097 3100 
3139-3147 3156-3157 3159-3162 3167-3168 
3170-3173 3181-3184 3189-3191 3195 3204 
3210-3211 3224-3228 3240 3276-3277 3289- 
3290 3302 3310-3311 3323-3324 3336-3338 
3341 3420 3428-3429 3444-34453451-3455 
3470-3471 3505-3506 3558-3560 3595-3596 
3606-3607 3619-3625 3632-3634 3663 3670- 
3671 3689 3691 3699 3707 3724-3725 3729- 
3730 3735 3742-3744 3747 3795-3800 3804- 
3806 3854 3858-3859 3864-3865 3882-3883 
3959-3960 3962 3969-3971 4014-4015 4030- 
4031 4033-4034 4040-4042 4055-4056 4060- 
4064 4068-4069 4087-4089 4101-4102 4114- 
41 16 4123 4126 4137 4140-4141 4157-4158 


WO 01/79449 


PCTAJS01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ IDNOS: 




4220-4223 4228 4230-4235 4291-4295 4320 
4322 4335-4337 4404-4407 4420-4424 4581- 
4586 4613 4627 4633-4634 4652-4653 4672- 
4674 4740-4741 4761-4766 4768-4771 4776 
4796-4797 4835 4862-4865 4886 4900 4907- 
4910 4912 4956-4957 4967-4970 4994-4995 
5000-5003 5021-5023 5085-5088 5095-5097 
5122 5135 5148-5149 5151-5152 5166-5167 
5190-5192 5222-5224 5260 5280 5303-5305 
5329-5330 5387-5391 5399-5401 5421-5422 
5427-5429 5464-5466 5485 5492-5495 

adult 
kidney 

GIBCO 

AKD001 

32-34 36-37 39-40 42 45-47 74-78 87 100- 
106 116-119 136 165 175-177 213-218 220 
223-231 235 244 252-253 258-260 278-283 
298-300 313-320 324-325 332-333 341-344 
346-349 364 366-367 379-380 394 396-398 
419 436 440-441 445-446 452 474 491-492 
498 519 548-553 557 574-577 583-590 602- 
603 607 629-630 652-653 677-678 682-684 
707 709-71 1 719-722 728-733 736-740 778- 
786 789-793 799-800 806-808 814-816 822- 
823 836-838 840-844 852-854 857-859 871- 
875 879-886 889-897 899-905 909-915 919- 
920 924-926 931-936 944-962964-966 969- 
974 980-988 994-995 997-998 1000-1009 
1017-1021 1026-1027 1036-1040 1042-1043 
1049-1050 1052 1063-1071 1075-1076 1078- 
1079 1081-1082 1085 1088-1091 1110-1113 
1116-1121 1127-1130 1137-1142 1151-1155 
1159 1172-1173 1182-1184 1189-1193 1198- 
1207 1217-1218 1220-1221 1230-1232 1235- 
1236 1249-1260 1269-1271 1278-1280 1287 
1294-1297 1300-1301 1307 1315-1321 1328 
1334-1335 1349-1350 1352-1354 1357-1362 
1374 1385-1389 1397-1399 1403-1407 1410 
1414-1420 1422-1423 1425-1426 1435-1436 
1438 1440-1441 1444 1451-1462 1465-1466 
1470-1472 1475-1477 1479 1481-1485 1488- 
1489 1498-1499 1504-1505 1507-1510 1515- 
1517 1524 1527-1532 1536-1538 1540-1548 
1551-1557 1561-1563 1569-1576 1579-1589 
1591 1597 1603-1608 161 1-1619 1625-1626 
1634-1648 1653-1654 1656 1663-1665 1667- 
1682 1685 1688-1692 1694-1698 1701-1704 
1707-1708 1710-1716 1719-1721 1723-1724 
1727-1739 1743-1746 1753 1755-1758 1763- 
1771 1773-1783 1796-1798 1805-1806 1814- 
1817 1830-1847 1857-1860 1865-1877 1882- 
1885 1903-1911 1913-1922 1925-1927 1948- 
1953 1964-1974 1978-1979 1981-1983 1993- 
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origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




1998 2004-2005 2010-2013 2021-2038 2042- 
2043 2045-2047 2055-2061 2068-2083 2086- 
2087 2092 2094 2096-2100 21 1 1-21 14 21 16 
2118 2125-2133 2137-2142 2144 2146-2147 
2151-2152 2156-2160 2173-2181 2186-2188 
2191-2195 2197-2199 2202-2203 2208-2213 
2216-2221 2223-2230 2234-2242 2244-2248 
2251-2252 2254-2257 2261 2267-2270 2272- 
2273 2280-2282 2284 2288-2291 2296-2297 
2302 2310 2318-2320 2331 2333-2334 2338- 
2340 2368 2377-2382 2386 2388-2392 2403 
2405-2415 2422-2424 2427 2430 2440-2441 
2446-2447 2451 2467-2472 2475 2483-2485 
2488 2490-2491 2496 2499 2510-2513 2521- 
2525 2528-2531 2536 2546 2554-2556 2564- 
2572 2574-2575 2579-2584 2591-2592 2596 
2604 2629-2637 2645-2649 2672-2676 2693 
2696-2697 2702-2706 2709-271 1 2716-2718 
2721 2726 2730 2734 2747-2748 2754-2758 
2760-2761 2763 2768-2772 2774-2778 2781 
2785 2800-2801 2805-2806 2809 2814 2818 
2828 2836-2839 2842-2843 2854-2863 2865 
2873-2874 2888 2894-2898 2901-2903 2913 
2925-2929 2931-2939 2945-2946 2960-2962 
2969-2976 2979 3009 3013-3014 3017-3022 
3026-3027 3054 3076-3078 3082 3098-3100 
3102-3105 3109 3136-3137 3139-3147 3151- 
3162 3167-3168 3170-3174 3189-3191 3195 
3204 3215-3216 3218 3224-3230 3234 3240 
3242 3256-3267 3276-3277 3280-3282 3285 
3288-3290 3292-3293 3296-3299 3313 3323- 
3324 3331-3335 3339-3340 3342-3344 3367- 
3368 3374-3382 3394-3398 3403-3404 3406- 
3407 3409-3410 3428-3429 3438-3441 3443- 
3445 3456 3462 3466-3468 3470-3471 3519 
3535-3543 3554-3556 3561-3562 3576-3580 
3589 3605 3610-3613 3619-3625 3628 3632- 
3634 3638-3640 3664-3665 3667 3670-3671 
3673-3677 3684 3686-3691 3716 3724-3725 
3742-3744 3747 3760-3761 3780-3781 3815- 
3816 3822-3824 3826 3830 3837-3838 3870 
3880 3882-3883 3895 3897-3905 3911-3919 
3939-3951 3955-3957 3959-3960 3966-3971 
3997-3998 4014-4015 4036-4039 4055-4056 
4060-4064 4071-4075 4077-4079 4082-4084 
4093 4098 4101-4102 4114-4116 4119-4123 
4136 4138-4143 4220-4223 4230-4235 4243- 
4244 4252-4253 4255-4257 4260 4267-4270 
4285-4288 4322 4335-4337 4342 4363 4383- 
4384 4391-4393 4400 4430-4432 4439 4451- 


WO 01/79449 


Tissue 
origin 


RNA 
Source 


ibrary 
Name 


SEQ ID N< 


PCT/US01/08656 


4452 4477 4520 
4569-4571 4598- 
4629-4634 4636- 
4679-4683 4691- 
4756-4757 4764- 
4815 4817 4824- 
4865 4878-4879 
4926 4967-4970 
5039 5045 5050- 
5085-5088 5109 
5148-5149 5154- 
5261-5267 5284- 
5372-5373 5387 
5437 5455 5464- 


4534 4546-4548 
-4601 4604 4613 
-4637 4652-4653 
-4692 4740-4741 
-4766 4768-4771 
■4826 4832-4834 
4900 4907-4910 
5001-5003 5021- 
•5052 5074 5078 
5115 5123-5124 
-5157 5184-5187 
-5289 5298-5302 
-5391 5393 5421- 
■5466 5485 5489 


4561 4563 
4616-4621 
4668-4671 
4751-4753 
4796-4797 
4845 4862- 
4912 4923- 
-5023 5038- 
5082-5083 
5137-5141 
5241-5242 
5308-5309 
5422 5435- 


adult 
kidney 


Invitrogen 


AKT002 


1-2 70 278-283 313-315 379-380 457 491- 
492 574-577 582 604 652-653 699-701 707 
709-711 719-722 764-771 794-795 814-816 
822-823 840-844 906-909 924 944 950 963 
975-988 993 995 1017-1021 1042-1043 1063, 
1067 1070-1071 1076 1079 1110-1113 1117- 
11191128 1137-1143 1172 1182-1184 1193. 
1208-1212 1220-1221 1235-1242 1278-1280 
1287 1297 1315-1318 1323-1328 1355 1357- 
1358 1360-1371 1374 1397 1405-1406 1414 
1418-1420 1425 1457-1462 1488 1507 1515 
1536-1538 1547-1548 1551-1552 1559-1560 
1579 1626 1656 1664 1674-1682 1685-1689 
1691-1693 1706 1708 1710-1716 1719-1721 
1728-1734 1737-1739 1753 1773-1774 1845- 
1851 1870-1875 1897 1903-1911 1913-1914 
1925 1948-1949 1951-1953 1978-1979 1981- 
1983 1990-1991 2004-2005 2017-2020 2027- 
2030 2038 2048 2054 2062-2064 2072-2074 
2076-2077 2116 2118 21 25-2 13321 56-2 1 60 
2174-2176 2179-2181 2186-2188 2208-2210 
2214-2215 2224-2228 2275 2277 2296-2297 
2321 2377-2379 2391 2397-2399 2421 2428 
2452-2454 2473-2474 2492-2494 2499 2528- 
2531 2536 2560 2579-2584 2592 2594-2595 
2608-2616 2706 2734 2781 2785 2818 2843- 
2845 2854 2861-2862 2886-2887 2974-2975 
2979 2984 2996-2998 3008 3100 3139-3147 
3149 3151-3152 3156-3157 3184 3195 3218 
3250-3252 3260-3267 3269 3313 3325-3327 
3336-3338 3341-3344 3424-3427 3550-3552 
3554-3556 3590 3624-3625 3628 3658-3660 
3663 3693 3791 3822-3824 3943-3948 4004 
4040-4042 4055-4056 4076 4093 4109-41 1 1 
4232-4235 4241-4242 4275-4277 4534 4549 
4622-4623 4633-4634 4740-4741 4764-4766 


inn 
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Tissue 
origin 


RNA 
Source 


Library 
Name 


PCT/US01/08656 


SEQ IDNOS: 


adult lung 


GIBCO 


ALG001 


4796-4797 481 1 4864-4865 4907-4909 4912 
4994-4995 5109 5137-5140 5148-5150 5154- 
5157 5194 5222-5224 5259 5261-5267 5272- 
5274 5276-5277 5298-5302 5329-5330 5367- 
5368 5387-5388 5435-5437 5485 


78 136 138-139 175-177 313-315 324-325 
341-344 413-414 440-441 456 491-492 511 
557 652-653 677-678 728-733 784-786 794- 
795 822-823 849-851 855-856 885-886 919- 
920 954-960 975-988 992-993 997-999 1003- 
1006 1017-1021 1026-1027 1042-1043 1053- 
1054 1075 1088 1129-1130 1143 1182-1184 
1189-1191 1198-1201 1208-1212 1271 1297 
1300-1301 1317-1318 1352-1355 1374 1407 
1422-1423 1455-1462 1481-1484 1488-1489 
1497 1507-1512 1516-1517 1532-1535 1541- 
1548 1551-1556 1582-1584 1588-1589 1591 
1603-1604 1611 1617-1619 1663 1723 1727- 
1734 1742-1746 1753 1780-1783 1814-1817 
1831-1834 1852 1870-1875 1919-1922 1925 
1951 2005-2007 2038 2058-2061 2072-2074 
2086-2087 2116 21 18 2121-2136 2144 2153- 
2155 2163-2168 2179-2181 2186-2187 2214- 
2215 2223-2228 2230 2234-2242 2277 2283 
2296-2299 2331 2380-2382 2389-2390 2467- 
2469. 2473-2474 2499 2536 2553 2564-2571 
2574-2575 2604 2672-2674 2677-2680 2749- 
2750 2759 2761 2774-2776 2843 2855-2856 
2913 2957 2960 2969-2973 3081 3084-3085 
3098-3099 3156-31573167-3168 3213-3214 
3220-3222 3226-3228 3238 3256 3280-3282 
3289-3290 3319-3322 3333-3335 3409-3410 
3442 3466-3468 3558-3560 3588 3621-3625 
3628 3689 3776-3777 3 815-3816 3893 3908 
4040-4042 4068-4069 41 14-41 16 4136 4232- 
4235 4291-4295 4335-4337 4404-4407 4439 
4545 4672-4674 4756-4757 4796-4797 4804- 
4805 4886 4907-4909 5001-5003 5046-5047 
5095-5097 5142-5143 5387-5388 5464-5466 


lymph 
node 


Clontech 


ALN001 


39-40 143-148 154-156 269 278-283 313-315 
445-446 728-733 736-742 764-771 814-816 
822-823 931-936 950 961-962 994 1000-1002 
1017-1021 1129-1130 1139-1142 1151-1153 
1182-1184 1198-1204 1244-1246 1256 1319 
1359 1398-1399 1425 1438 1455-1462 1478 
1504 1507 1511-1512 1532 1539 1547-1549 
1553-1556 1575-1576 1617-1619 1648 1659- 
1660 1663 1719-1721 1735-1736 1753 1755- 
1756 1839-1844 1857-1859 1919-1922 1925 
1951 1993-1998 2004 2038 2042-2043 2048 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




2060-2061 2086-2087 2107 21 1 1-21 14 2118 
2125-2130 2137 2144-2145 2191-2195 2208- 
2210 2214-2215 2223 2254 2277 2288-2291 
2296-2297 2394 2470-2472 2483 2526-2527 
2554-2556 2649 2774-2776 2852-2854 2861- 
2862 2865 2888 2896-2897 2965-2968 3029 
3133-3135 3189-3191 3242 3250-3252 3280- 
3282 3289-3290 3312 3333-3335 341 1-3417 
3577-3580 3638-3640 3716 3817-3821 3878- 
3879 3962 4023 4090-4092 4134 4140-4141 
4219 4285-4286 4581-4582 4796-4797 4864- 
4865 4907-4909 5001-5003 5261-5267 5272- 
5274 5323-5325 5332-5333 5335-5343 5423- 
5425 5444 

young 
liver 

GIBCO 

ALV001 

48-50 78 100-110 210-211 255-257 261-266 
278-283 286-287 313-320 332-333 381-383 
395 419 435-436 491-492 548-553 574-577 
652-653 677-678 709-71 1 755-757 784-786 
789-793 799-803 806-808 822-823 840-844 
852-854 910-914 916-91 8 924 944 969 995 
997-998 1056-1057 1063-1068 1085 1089- 
1091 1116 1120-1121 1128-1130 1139-1142 
1151-1155 1172 1177-1179 1182-1184 1189- 
1191 1198-1201 1205-1207 1217-1218 1220- : 
1221 1230-1232 1249-1256 1269-1273 1290- 
1291 1300-1301 1310-1314 1323-1328 1357- 
1358 1360-1362 1374 1410 1418-1420 1479- 
1484 1497 1507 1516-1517 1527-1531 1541- 
1546 1551-1552 1557 1579-1581 1585 1590 
1592 1613-1619 1626 1656 1664 1685 1691- 
1692 1694-1698 1701-1702 1708-1709 1723 
1725-1726 1735-1739 1753 1759-1762 1765- 
1771 1773-1774 1780-1790 1796-1798 1827- 
1829 1835-1838 1848-1852 1865-1875 1882- 
1885 1903-1911 1913-1914 1919-1922 1925 
1951 1964-1965 1978-19792005 2031-2034 
2060-2061 2075 2086-2091 2096-20972118 
2144 2153-2160 2174-2176 2188 2200-2201 
2223-2228 2234-2242 2244-2245 2281-2282 
2288-2291 2321 2358 2380-2382 2414-2415 
2423-2424 2427 2447 2451 2469 2477-2479 
2484-2485 2503-2504 2510 2533 2543-2544 
2560 2564-2571 2579-2584 2587 2648 2761 
2836-2839 2843 2865 2873-2874 2879-2881 
2945-2946 2951-2952 2957 2974-2975 3013- 
3014 3076-3078 3139-3147 3151-3152 3156- 
3157 3181-3183 3195 3226-3228 3242 3250- 
3252 3280-3282 3299 3310-331 1 3328-3330 
3345-3346 3403-3404 3456 3462 3561-3562 
3599-3601 3619-3625 3628 3654-3657 3815- 
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RNA 
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Library 
Name 

SEQ ID NOS: 




3816 3828 3969-3971 4023 4062-4064 4090- 
4092 4121-4122 4228 4373-4375 4403 4451- 
4452 4563 4622-4623 4635-4637 4738-4739 
4768-4771 4796-4797 4815 4899 4907-4909 
4915-4916 4998-4999 5001-5003 5045 5075- 
5077 5082-5083 5130-5131 5166-5167 5226 
5335-5343 5399-5403 5414-5415 5452-5453 
5456-5457 5464-5466 

adult 
liver 

Invitrogen 

ALV002 

35-37 62 70 107-110 131-132 175-177 192 
233 255-257 261-266 278-283 313-315 337 
354 365 374-375 445-446 450-451 478 491- 
492 652-653 801-803 840-844 848 852-854 
903-905 944 954-956 995 997-998 1003-1006 
1026-1027 1032-1034 1042-1047 1049 1056- 
1060 1063-1071 1078 1089-1091 1117-1119 
1139-1143 1151-1154 1158-1159 1177-1181 
1188-1191 1193 1205-1207 1217-1218 1230- 
1232 1278-1282 1307 1310-1314 1323-1327 
1337-1345 1351 1360-1371 13801451-1454 
1485 1533-1535 1547-1548 1569-1574 1592 
1626 1640-1647 1656 1663 1691-1692 1708- 
1709 1723 1725-1726 1735-1739 1759-1762 
1770-1771 1773-1774 1827-1829 1835-1844 
1913-1914 1919-1922 1925 1948-1949 1954- 
1962 1981-1983 2010-2013 2025-2026 2054 
2060-2061 2118 2171 2174-2176 2186-2190 
2193-2195 2208-2210 2223 2254 2267-2270 
2276-2277 2296-2297 2308 2322 2338-2340 
2380-2381 2499 2533 2536 2543-2544 2560 
2579-2584 2629-263 1 2648 2659-2662 2665 
2741-2743 2800-2801 2828 2843 2865 2879- 
2882 2905 2914-2917 2925-2929 2957 2960- 
2962 2974-2975 3013-3014 3054 3089 3156- 
3157 3181-3183 3199 3220-3222 3229 3310- 
331 1 3328-3330 3371-3373 3462 3466-3469 
3472-3473 3536-3543 3577-3580 3667 3749- 
3752 3793 3997-3998 4014-4015 4036-4039 
4082-4084 4096-4097 4282 4330-4331 4376- 
4377 4381 4451-4452 4616-4621 4633-4634 
4636-4637 4649 4687-4689 4738-4739 4754- 
4755 4768-4771 4796-4797 5050-5052 5057- 
5065 5082-5083 5130-5131 5145 5148-5149 
5164-5167 5229-5231 5335-5343 5367-5368 
5387-5391 5414-5415 5451-5453 

adult 
liver 

Clontech 

ALV003 

341-344 370-371 849-851 946-949 1177- 
1179 1202-1204 1626 1759-1762 1770-1771 
1913-1914 2484-2485 3328-3330 4403 4998- 
4999 5130-5131 

adult 
ovary 

Invitrogen 

AOV001 

12-13 32-34 39-40 42 44 47-50 52 63-64 70 
74-78 87 100-110 116-119 133 135-139 153 
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origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




173 175-177 185 201-205 212-215 220 222 
227-230 233 245 267-268 277-283 286-287 
291 301-303 313-315 321 341-344 357-361 
364 372 376-377 379-380 394 396-398 436 
445-446 459 462 474 478 491-495 509 51 1 
520-524 538 543 545 551-553 561-562 574- 
577 583-594 604-607 61 1-620 629-630 641 
652-653 677-678 682-684 697 699-703 707- 
711 719-722 728-733 743-744 747-750 755- 
757 764-771 784-786 789-795 801-803 806- 
808 814-816 822-825 836-837 840-844 855- 
856 863-869 871-875 879-886 889-897 899- 
908 910-914 916-920 924 927 930-936 944 
950-962 964-966 969 971-988 990-995 997- 
1006 1008-1009 1017-1027 1032-1040 1042- 
1047 1049 1052-1054 1068 1070-1071 1075- 
1076 1078-1079 1081-1082 1089-1091 1095 
1108 1117-1121 1128-1142 1151-1156 1158- 
1164 1171-1173 1175 1180-1185 1189-1193 
1198-1207 1217-1218 1220-1221 1228-1232 
1235-1242 1244-1246 1249-1256 1269-1271 
1278-1280 1287 1290-1293 1297-1301 1307 
1315-1328 1332-1335 1348-1359 1363-1371 
1374 1380 1383-1384 1386-1389 1395-1396 
1398-1399 1403-1410 1413-1417 1421-1423 
1426 1432 1435-1436 1438-1444 1446-1449 
1451-1464 1467-1473 1475-1480 1485 1488 
1491-1494 1498-1499 1504-1505 1507-1512 
1515-1517 1520 1527-1538 1541-1548 1550- 
1557 1569-1576 1580-1589 1591 1603-1608 
1611-1612 1617-1619 1621-1623 1625 1629- 
1632 1638-1645 1648-1654 1656-1658 1663- 
1664 1666-1670 1674-1682 1685-1686 1688- 
1692 1694-1698 1701-1702 1707-1709 1717 
1719-1721 1723 1727-1739 1743-1746 1753 
1755-1756 1758 1763-1769 1780-1783 1792- 
1817 1827-1830 1835-1838 1848-1853 1860 
1865-1877 1879-1885 1900-1911 1915-1922 
1925-1936 1948-1953 1964-1965 1978-1979 
1981-1983 1990-1991 1993-1998 2000-2002 
2004-2005 2017-2024 2027-2037 2042-2043 
2045-2048 2052-2061 2066-20672076-2077 
2080-2082 2086-2091 2093-2094 2096-2100 
2111-2115 2118 2125-2133 2138-2147 2151- 
2160 2174-2177 2179-2181 2186-2187 2189- 
2195 2197-2201 2204-2215 2223 2229 2231- 
2232 2234-2242 2251-2252 2254-2262 2264- 
2265 2267-2271 2273 2275 2277 2281-2284 
2286-2291 2296-2300 2321 2331 2380-2381 
2386-2392 2395 2397-2399 2403 2414-2415 
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2421-2424 2427 2433 2437 2440-2441 2443 
2445 2451-2454 2456 2463 2465 2467-2469 
2473-2474 2483 2488 2492-2494 2499 2502 
2510-2513 2521 2523 2536-2538 25462548- 
2550 2552 2558-2562 2574-2575 2587 2591- 
2595 2598 2604 2626 2628-2636 2639-2642 
2645-2648 2650 2656-2662 2664-2665 2668 
2675-2680 2686 2693 2696-2697 2702-2706 
2721 2726 2734-2735 2739 2741-2743 2746- 
2747 2754-2758 2760-2761 2763-2764 2768- 
2771 2774-2778 2781 2783 2785-27892800- 
2801 2807 2809 2814 2818-2820 2824-2826 
2828-2829 2831-2832 2842-2843 2854-2862 
2865-2867 2871-2874 2883-2884 2886-2888 
2894-2900 2905 2914-2917 2931-2939 2945- 
2947 2951-2952 2954 2961-2962 2965-2975 
2979 2985 2989-2990 2996-2998 3000 3008- 
3009 301 1-3012 3017-3020 3030-3032 3035- 
3036 3054 3068 3070 3084-3085 3089 3098- 
3100 3102-3105 3109-3110 3121 3130 3133- 
3138 3151-3152 3158 3166 3170-3174 3181- 
3183 3192 3195 3197 3199-3203 3210-3216 
3218-3230 3234 3240 3242-3243 3245 3250- 
3259 3269 3276-3277 3280-3282 3285 3289- 
3290 3302 3310-3312 3323-3327 3331-3340 
3342-3344 3394-3395 3403-3404 3409-3417 
3422 3424-3429 3438-3442 3446-3449 3456 
3462 3466-3468 3470-3473 3477-3478 3480 
3491-3495 3516-3517 3519 3522 3535-3543 
3546 3550-3553 3558-3562 3577-3580 3586 
3588 3591-3593 3621-3623 3628-3630 3632- 
3634 3658-3662 3666-3667 3673-3677 3680 
3689 3691 3701 3714 3723-3725 3735 3737- 
3738 3761 3763 3770-3772 3784-3788 3795- 
3800 3803-3807 3810 3815-3816 3822-3824 
3828 3835 3837-3838 3870-3872 3878-3880 
3882-3883 3897-3905 3911-3919 3925-3926 
3931 3941-3942 3955-3957 3959-3960 3963- 
3971 4001 4014-4018 4021 4023 4036-4045 
4055-4056 4060-4061 4068-4069 4071-4079 
4082-4084 4087-4089 4093 4096-4098 4100- 
4102 4104-4105 4109-41 16 4121-4123 4126- 
4131 4136-4143 4145-4153 4215-42174220- 
4223 4228 4230-4235 4241-4244 4252-4253 
4255-4257 4261-4266 4272-4277 4285-4286 
4291-4295 4326 4335-4337 4341 4374-4375 
4379-4381 4383-4384 4391-4393 4425 4430- 
4432 4438-4439 4461-4464 4481-4489 4501 
4533 4542-4544 4549-4550 4598-4601 461 1 
4613 4616-4623 4642 4652-4653 4663 4668- 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




4674 4681-4683 4691-4692 4740-4741 4748- 
4749 4751-4753 4764-4766 4777-4780 4784- 
4786 4796-4797 4804-4805 4812 4820 4854 
4861-4865 4874-4875 4886 4902 4907-4910 
4912 4918-4920 4923-4925 4953 4956-4957 
4976-4979 5001-5003 5025 5040 5050-5052 
5057-5065 5085-5088 5095-5097 5109 5115 
5123-5127 5148-5149 5154-5157 5173 5222- 
5224 5241-5242 5251 5259 5261-5267 5276- 
5277 5281 5284-5289 5308-5309 5329-5330 
5335-5343 5367-5368 5387-5391 5399-5401 
5408-5410 5421-5425 5427-5429 5431-5433 
5439 5455 5464-5466 5485 5490-5491 

adult 
placenta 

Clontech 

APL001 

313-315 491-492 605-606 719-722 736-740 
834-835 950 971-979 1017-1022 1151-1153 
1182-1184 1215-1216 1410 1418-1420 1532 
1539-1540 1564-1566 1639 1719-1721 1747- 
1751 1870-1875 1925 1966-1974 1984 2101- 
2102 2230 2424 2510 2524-2525 2574-2575 
2645-2647 2668 2818 2873-2874 3323-3324 
3462 3483 4040-4042 4101-4102 4581-4582 
4793-4795 5188-5189 5376 

placenta 

Invitrogen 

APL002 

12-13 192 364 491-492 520-521 709-71 1 
755-757 789-793 840-844 885-886 975-979 
1026-1027 1042-1043 1050 1070-1071 1076 
1117-1119 1160-1164 1202-1207 1215-1216 
1272-1273 1320-1321 1351 1360-1362 1380 
1400-1401 1442-1443 1473 1553-1556 1564- 
1566 1603-1608 1621-1623 1694-1698 1724 
1737-1739 1743-1745 1747-1751 1780-1783 
1860 1948-1949 2062-2064 2072-2074 2101- 
2102 21 1 1-21 14 2146-2147 2186-2187 2204- 
2207 2310 2434-2435 2470-2472 2488 251 1- 
2513 2594-2595 2645-2647 2677-2680 2696- 
2697 2737-2738 2831-2832 2836-2839 2899- 
2900 2925-2929 2954 3065-3067 3195 3199 
3220-3222 3288 3313 3336-3338 3391 3403- 
3404 3466-3468 3536-3543 3624-3625 3638- 
3640 3932-3933 4082-4084 4273-4274 4291- 
4295 4410-4412 4701-4709 4740-4741 5431- 
5433 5435-5437 

adult 
spleen 

GIBCO 

ASP001 

78 137 154-156 175-177 213-215 274 278- 
283 313-315 324-325 332-333 341-344 346- 
349 420 456 479-480 491-492 511 530 557 
612-620 652-653 659-660 677-678 699-701 
728-733 741-744 764-771 784-786 814-816 
822-823 849-851 924 944 950 964-966 975- 
988 994-995 997-998 1000-1002 1008-1009 
1049 1052 1070-1071 1078 1088-1091 1128- 
1136 1151-1153 1171-1172 1193 1195-1196 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




1202-1204 1217-1218 1220-1221 1256 1269- 
1271 1287 1294-1297 1315-1318 1332-1333 
1349-1350 1352-1354 1359 1363-1371 1374 
1383-1384 1386-1387 1397 1408-1409 1414 
1418-1420 1422-1423 1425 1440-1441 1446- 
1449 1486 1497 1507-1512 1514 1516-1517 
1527-1532 1540-1548 1551-1552 1575-1576 
1586-1589 1597 1603-1604 1612 1617-1619 
1621-1623 1629-1632 1634-1637 1640-1645 
1654 1656 1663 1686 1691-1692 1708 1710- 
1714 1719-1721 1723-1724 1727 1737-1739 
1746 1753 1765-1769 1773-1774 1780-1783 
1796-1798 1807-1817 1827-1834 1853 1857- 
1859 1870-1885 1903-1911 1913-1914 1919- 
1922 1948-1949 1951 1964-1965 1978-1979 
2025-2026 2035 2038 2040-2043 2045-2047 
2054 2060-2061 2072-2074 2076-2079 2086- 
2087 2111-2114 2116 2118 2131-2133 2137 
2144 2148-2150 2153-2155 2178 2182-2183 
2214-2215 2223 2230 2234-2242 2281-2283 
2298-2299 2303-2304 23102331 2380-2382 
2405-2413 2421 2440-2441 2452-2454 2456 
2461 2469-2472 2488 2510-2513 2551 2560 
2573 2603-2604 2608-2616 2650 2696-2697 
2719-2720 2726 2747 2754-2758 2803 2818 
2831-2832 2843-2845 2854 2861-2862 2873- 
2874 2914-2917 2945-2946 2974-2976 3153 
3158 3167-3168 3170-3171 3195 3210-3211 
3215-3216 3226-3228 3250-3252 3258-3259 
3280-3282 3289-3290 3336-3338 3385 3403- 
3404 3428-3429 3466-3468 3536-3543 3561- 
3562 3591-3593 3621-3625 3629-3630 3632- 
3634 3716 3784-3786 3792 3815-3816 3878- 
3879 3886 3935 3966-3971 4014-4015 4023 
4036-4039 4060-4061 4077-4079 4090-4092 
4098 4100 4126 4142-4143 4228 4232-4235 
4239-4240 4335-4337 4374-4375 4400 4404- 
4407 4451-4452 4554-4555 4598-4601 4622- 
4623 4662 4668-4671 4740-4741 4796-4797 
4832-4834 4864-4865 4907-4909 4912 4956- 
4957 5001-5003 5034-5036 5074 5095-5097 
5123-5124 5148-5149 5154-5157 5241-5242 
5261-5267 5272-5274 5298-5302 53 10-531 1 
5329-5330 5335-5343 5427-5429 5440-5441 
5485 

testis 

GIBCO 

ATS001 

47 81-82 123 136 154-156 175-177 179 227- 
230 278-283 313-315 341-344 366-367 379- 
380 456 491-492 574-577 604 652-653 677- 
678 682-684 699-701 743-744 764-771 784- 
786 811-816 822-823 826-828 879-881 885- 


WO 01/79449 


PCTAJS01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




886 906-908 931-936 944-945 950 957-960 
969 971-974 993-995 997-1002 1008-1009 
1026-1027 1032-1034 1036-1037 1042-1043 
1075-1076 1080 1108 1137-1138 1173 1189- 
1191 1198-1204 1230-1232 1235-1236 1271 
1278-1282 1297 1310-1314 1317-1319 1334- 
1335 1349-1350 1357-1358 1374 1397 1403- 
1404 1413 1418-1420 1422-1423 1435-1436 
1451-1462 1485 1507 1516-1517 1547-1548 
1551-1552 1561-1563 1611 1629-1632 1640- 
1645 1648 1663 1667-1670 1685 1694-1698 
1703-1704 1708 1715-1716 1719-1721 1724 
1746 1753 1755-1756 1758 1770-1771 1780- 
1783 1814-1817 1827-1829 1848-1851 1853 
1865-1869 1882-1885 1898 1925-1927 1948- 
1949 1951 1966-1974 1981-1983 2021-2024 
2027-2030 2038 2042-2043 2052-2056 2060- 
2064 2072-2074 2080-2082 2086-2087 2096- 
21002118 2144 2146-2147 2153-2155 2177 
2186-21872216-2221 2231-2232 2234-2242 
2254 2267-2270 2275 2283 2310 2331 2380- 
2382 2387 2424 2447 2452-2454 2456 2468 
2473-2474 2499 2510 2536 2548 2573 2592 
2604 2644 2657-2658 2706 2715-2718 2747 
2754-2758 2761 2763-2764 2768-2771 2774- 
2776 2783 2824-2826 2843 2865-2867 2894- 
2895 2898 2945-2946 2961-2962 2989-2990 
3008 3013-3014 3017-3020 3029 3139-3147 
3167-3168 3195 3204 3212-3214 3217-3218 
3226-3228 3242-3243 3256 3285 3289-3290 
3304-3307 3339-3340 3442 3558-3562 3576 
3588 3595-3596 3628 3689 3691 3707 3723 
3735 3795-3800 3810 3871-3872 4014-4015 
4040-4042 4060-4061 4071-4075 41 14-41 16 
4121-4123 4126 4136 4142-4143 4230-4231 
4241-4242 4252-4253 4335-4337 4379-4380 
4449 4465 4542-4544 4549 4581-4586 4598- 
4601 4740-4741 4796-47974832-4834 4864- 
4865 4907-4910 5021-5023 5038-5039 5046- 
5047 5107-5108 5284-5289 5372-5373 5387- 
5391 5399-5401 

Genomic 
DNA from 
BAC 
63118 

Research 

Genetics 

(CITB 

BAC 

Library) 

BAC001 

3895 

Genomic 
DNA from 
BAC 
39316 

Research 

Genetics 

(CITB 

BAC 

Library) 

BAC002 

2639-2642 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 

adult 
bladder 

Invitrogen 

BLD001 

154-156 175-177 301-303 341-344 652-653 
659-660 950 980-988 997-998 1042-1043 
1069 1075 1139-1142 1160-1164 1193 1244- 
1246 1307 1508-1510 1575-1576 1717 1728- 
1734 1746 1805-1806 1870-1875 1882-1885 
1903-1911 1981-1983 2004 2006-2007 2038 
2060-2061 2072-2074 2118 2191-2192 2273 
2283 2294-2295 2344 2639-2642 2721 2747 
2818-2819 2914-2917 3112 3212 3280-3282 
3424-3427 3470-3471 3536-3543 3664-3665 
3691 3760 3791 3795-3800 4014-4015 4082- 
4084 4335-4337 4613 4796-4797 4864-4865 
4960 5001-5003 5241-5242 5387-5388 5431- 
5433 

bone 
marrow 

Clontech 

BMD001 

30-31 42 48-50 74-78 114-115 120-123 137 
143-165 175-177213-215 227-230232 235 
278-290 297-303 305-309 313-315 324-325 
335 341-344 354 379-380 394-398 435-438 
440-441 447-455 462-471 491-492 513 516 
520-521 538 551-553 557 561-562 641 652- 
653 661-671 674 677-678 680-684 699-701 
709-760 763-772 794-795 822-823 849-85 1 
857-859 863-869 882-886 889-897 909-914 
916-918 921 924-926 931-936 944-945 950- 
956 969 980-988 992-995 997-1021 1026- 
1027 1032-1034 1038-1040 1049 1053-1055 
1070-1071 1075 1079 1108 1110-1113 1128- 
1136 1139-1143 1151-1154 1173 1182-1184 
1186-1187 1193 1198-1204 1217-1218 1220- 
1221 1228 1230-1232 1249-1256 1264 1269- 
1271 1274 1281-1282 1290-1291 1294-1297 
1317-1319 1322-1345 1348-1362 1374-1379 
1386-1387 1397-1399 1405-1407 1414-1417 
1422-1423 1425 1437-1438 1440-1441 1444 
1451-1464 1470 1479 1485-1489 1497-1500 
1504-1505 1507-1512 1514-1515 1518-1520 
1522-1526 1532-1563 1567-1576 1582-1585 
1588-1589 1603-1608 1612 1621-1623 1625 
1629-1632 1634-1637 1646-1648 1655-1656 
1659-1660 1663-1664 1666-1670 1685-1690 
1694-1698 1701-1702 1707-1708 1710-1716 
1719-1721 1723-1724 1728-1739 1746 1752- 
1753 1755-1756 1765-1771 1773-1779 1805- 
1813 1830-1838 1853 1857-1860 1870-1*875 
1879-1881 1894-1896 1913-1922 1925-1936 
1948-1951 1963 1966-1974 1978-1979 1993- 
1998 2000-2003 2005 2017-2020 2027-2030 
2036-2056 2060-2064 2066-2067 2080-2082 
2086-2087 2095 2098-2102 2107-2108 21 1 1- 
2118 2121-2150 2153-2168 2172 2174-2177 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




2191-2195 2202-2203 2214-2221 2223 2229 
2231-2242 2246-2248 2254 2262 2264 2273 
2283 2288-2291 2294-2299 2302 2311-2312 
2327-2330 2358 2377-2379 2387-2403 2418 
2422-2424 2427 2440-2441 2443 2448-2465 
2467-2469 2473-2474 2480 2488 2495 2510- 
2513 2519-2520 2528-2531 2560 2572 2592 
2598 2604 2628 2644-2648 2650 2656 2677- 
2680 2686 2698-2699 2715 2719-2720 2722- 
2744 2749-2750 2754-2758 2760-2761 2768- 
2771 2774-2776 2781 2783 2785 2793-2820 
2824-2826 2829 2843 2846-2847 2863-2867 
2873-2874 2888 2891 2894-2895 2904-2905 
2931-2939 2945-2946 2965-2973 2976 3008 
301 1-3012 3017-3022 3029 3041-3049 3054 
3100 3102-3105 3150 3166-3175 3181-3186 
3188-3194 3204 3208-3209 3212 3220-3222 
3226-3230 3235-3243 3245-3252 3256-3273 
3276-3277 3280-3283 3285 3289-3290 3299 
3304-3307 3319-3322 3341-3346 3372-3373 
3402 3406-3407 3422 3424-3427 3438-3441 
3446-3449 3456 3466-3468 3470-3471 3486- 
3487 3491-3495 3505-3506 3508-3513 3536- 
3543 3550-3552 3557-3562 3566-3573 3576 
3598-3607 3609-3614 3616-3628 3663-3665 
3673-3677 3682 3707 3724-3725 3729-3730 
3742-3744 3754 3761 3792 3794-3809 3817- 
3821 3826 3828 3836-3861 3867-3869 3878- 
3879 3881-3884 3897-3905 3911-3919 3955- 
3957 3969-3971 4023 4028-4029 4052 4055- 
4056 4082-4084 4094-4095 4101-4107 4109- 
4120 4136 4142-4153 4156-4159 4167-4178 
4208-421 1 4215-4223 4227-4247 4267-4270 
4275-4277 4285-4286 4291-4296 4383-4384 
4430-4432 4494-4496 4501-4503 4517-4529 
4531-4536 4554-4555 4572-4591 4596-4601 
4624-4626 4649 4651 4662 4664-4665 4691- 
4692 4729 4738-4741 4761-4780 4793-4810 
4832-4834 4862-4865 4884 4907-4910 4923- 
4928 4930-4931 4933-4935 4937-4943 4945 
4961-4985 5001-5003 5038-5039 5050-5052 
5080 5114-5115 5137-5141 5148-5149 5153- 
5157 5180 5190-5192 5241-5242 5250 5252 
5254-5277 5303-5305 5307-5325 5327-5343 
5345-5354 5367-5374 5376-5379 5381-5385 
5387-5388 5397-5398 5444 5460-5461 5464- 
5466 5485 

bone 
marrow 

Clontech 

BMD002 

175-177 249-250 254 258-260 301-303 313- 
315 324-325 413-414 440-441 491-492 540 
574-577 580-581 592-594 599-601 612-620 
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origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




652-653 724-733 741-750 804-805 814-816 
846 849-851 889-897 903-905 910-914 957- 
960 970 992 994 997-998 1010-1014 1023- 
1027 1038-1040 1089-1091 1095 1110-1113 
1128 1202-1207 1217-1218 1235-1236 1256 
1271 1297 1319 1330-1333 1348-1350 1352- 
1354 1357-1358 1383-1384 1397 1457-1462 
1479 1491-1494 1497 1504 1507-1512 1532 
1547-1548 1551-1552 1575-1576 1621-1623 
1646-1647 1686 1719-1721 1727 1743-1745 
1753-1754 1763-1764 1773-1779 1796-1798 
1805-1806 1814-1817 1827-1830 1839-1844 
1848-1851 1913-1918 1925 1993-1998 2040- 
2043 2048 2052-2054 2060-2061 2078-2079 
2088-20912116-2118 2131-2142 2148-2150 
2172 2174-2176 2191-2195 2223 2246-2248 
2318-2320 2537-2538 2553 2604 2638 2702- 
2705 2709-2711 2713-2714 2739 2781 2796- 
2798 2803 2931-2939 2961-2962 3026-3027 
3055 3130 3159-3162 3181-3183 3246 3250- 
3252 3304-3307 3402 3536-3543 3793 3847- 
3849 3925-3926 4024-4025 4060-4061 4209- 
4210 4228 4252-4253 4267-4270 4574 4581- 
4582 4729 4787 4796-4797 4858 4907-4909 
4974 4991-4993 5021-5023 5050-5052 5056 
5148-5149 5260-52675272-5275 5278-5279 
5335-5343 5377-5378 5416 5423-5425 5485 

bone 
marrow 

Clontech 

BMD004 

728-733 849-851 1349-1350 1486 1860 2050- 
2051 2134-2136 2148-2150 2234-2242 2803 
4209-4210 4598-4601 4652-4653 4907-4909 
5261-5267 5272-5274 

bone 
marrow 

Clontech 

BMD007 

396-398 440-441 453-455 491-492 712-718 
764-771 814-816 846 849-851 1096-1104 
1146-1147 1315-1316 1486 1497 1522-1523 
.2134-2136 2148-2150 2223 2803 3250-3252 
4598-4601 5001-5003 5050-5052 5310-5311 

adult 
colon 

Invitrogen 

CLN001 

1-2 32-34 64 175-177 251 278-283 452 478 
814-816 832 870 889-897 944 957-960 1044- 
1047 10691117-11191128 1139-1142 1195- 
1196 1217-1218 1317-1318 1386-1387 1511- 
1512 1547-1548 1640-1645 1709 1770-1771 
1860 1870-1875 1882-1885 1948-1949 1952- 
1953 1981-1983 2088-2091 2146-2147 2156- 
2160 2174-2176 2208-2210 2254 2347-2348 
2511-2513 2604 2629-2631 2737-2738 2831- 
2832 2852-2853 2865 3035-3036 3156-3157 
3220-3222 3246 3339-3340 3554-3556 3632- 
3634 3663 3673-3677 3693 3780-3781 3870 
3949-3951 4272 4275-4277 4330-4331 4534 
4636-4637 4651 4668-4674 4776 4796-4797 


WO 01/79449 
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Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQIDNOS: 

Mixture 
of 16 
tissues - 
mRNAs* 

Various 
Vendors* 

CTL016 

210-211 910-914 995 1128 1479 1617-1619 
1626 1784-1790 1913-1914 2901-2903 2979 
3831-3833 4796-4797 5001-5003 5075-5077 
5154-5157 5414-5415 

Mixture 
of 16 
tissues - 
mRNAs* 

Various 
Vendors* 

CTL021 

175-177 237-240 652-653 801-803 849-851 
950 993 1042-1043 1063-1067 1156 1310- 
1314 1332-1333 1485 1511-1512 1533-1535 
1746 2148-2150 2182-2183 2186-2187 2223- 
2228 2233 2253 2484-2485 2843 2979 3189- 
3191 3250-3252 4796-47974907-4909 5001- 
5003 5050-5052 5196 5226 

adult 
cervix 

BioChain 

CVX001 

1-2 32-34 52 56 70 107-110 123 125 133-134 
137 140-142 153-156 175-177 195-196 212 
227-230 233 278-283 288-290 301-303 313- 
315 324-325 335 341-344 365 379-380 394 
396-398 491-492 514 520-521 539 583-590 
597-598 611 682-684 697 699-701 708 719- 
722 810 814-816 822-823 840-844 857-859 
863-870 873-875 879-881 885-886 889-897 
899 903-905 909 915 919-920 925-926 931- 
936 950-953 957-962 975-988 992-995 997- 
998 1000-1002 1022 1032-1034 1044-1047 
1049 1052 1069 1075 1110-1113 1129-1130 
1144-1145 1154-1155 1165-1170 1172-1173 
1182-1184 1198-1204 1215-1216 1220-1221 
1256 1263 1271 1287 1297 1300-1301 1319- 
1321 1323-1328 1352-1355 1360-1371 1374 
1397 1400-1401 1410 1413 1421 1440-1444 
1455-1464 1470 1475-1477 1479-1480 1487 
1491-1494 1504 1507-1510 1515-1517 1524 
1 547- 1548 1 55 1 - 1 552 1 557 1 569-1 574 1 599- 
1608 1611 1620 1625 1639 1648 1653-1654 
1657-1658 1663 1683-1685 1690 1715-1716 
1723 1735-1736 1753-1756 1763-1764 1780- 
1783 1792-1795 1805-1806 1827-1829 1835- 
1844 1852 1870-1877 1879-1881 1896 1925- 
1927 1951 1964-1965 1993-1998 2000-2002 
2005 2021-2024 2031-2035 2038 2042-2043 
2048 2050-2056 2058-2059 2062-2064 2066- 
2067 2072-2074 2078-2079 2086-2087 2096- 
2100 2111-211421162118 2137 2143-2144 
2146-2147 2156-21602177-2181 2191-2192 
2216-2221 2223-2228 2234-2242 22492251- 
2252 2254-2257 2273 2275 2277 2280-2282 
2296-2299 2302 2327-2331 2333-2334 2341 
2344 2349-2356 2358 2368 2377-2381 2389- 
2390 2423-2424 2456 2467 2483 2490-2494 
2499 2510-2513 2546 2549-2550 2560 2563 
2573-2575 2591 2594-2595 2597 2603-2604 
2628-2631 2645-2647 2651-2655 2706 2713- 
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Library 
Name 

SEQIDNOS: 




2715 2726 2752-2758 2760 2763 2768-2771 
2774-2776 2783 2807 2809 2820 2824-2826 
2829 2831-2832 2836-2839 2843-2845 2854 
2865-2867 2871-2872 2898 2905 2925-2929 
2931-2939 2945-2947 2954 2961-2962 2969- 
2976 2979 2981-2983 2985 2989 3009 3017- 
3020 3030 3080 3089 3093 3096-3097 3100 
3109 31 12 3130 3149 3156-3158 3170-3173 
3181-3183 3195 3199 3206 3213-3214 3220- 
3222 3224-3225 3247-3248 3253-3256 3258- 
3267 3280-3282 3285 3289-3290 3292-3293 
3310-3311 3319-3324 3341 3345-3346 3349- 
3352 3394-3398 3403-3405 3409-3410 3420 
3424-3427 3437-3441 3446-3449 3456 3462 
3491-3495 3535-3543 3558-3560 3628-3630 
3654-3657 3673-3677 3680 3691 3707 3714 
3735-3736 3760 3772 3778 3795-3800 3807 
3822-3824 3830 3870-3872 3882-3883 3891 
3925-3926 3959-3960 3969-3971 4076 4098 
4100-4102 41 12-41 16 4127-4130 4219-4223 
4230-4231 4241-4242 4245 4289-4295 4322 
4382 4391-4393 4403 4435-4437 4550 4581- 
4582 461 6-4621 4629-4632 4663 4675 4679 
4681-4683 4761-4766 4785 4796-4797 4884 , 
4910 4913 4953 4956-4957 4976-4979 5151- 
5152 5177-5179 5272-5274 5284-5289 5293 
5303-5305 5335-5343 5421-5425 5431-5433 
5464-5466 

diaphrag 
m 

BioChain 

DIA002 

574-577 1230-1232 1524 1605-1608 2116 
2143 2843 3795-3800 4060-4061 4598-4601 

endotheli 
al cells 

Stratagene 

EDT001 

1-2 32-34 38 45-46 56 70 74-77 137 140-142 
165 173 175-177 187-190 195-196 213-215 
220 231 278-283 294-295 313-315 330 332- 
333 341-344 346-349 364 366-367 379-380 
395 445-446 474 491-495 511 520-521 531- 
532 545 548-553 574-577 612-620 652-653 
682-684 697 704-706 709-71 1 719-722 801- 
803 8 1 1 -8 1 6 822-823 836-837 863-872 879- 
881 885-886 889-897 899 903-914 919-920 
927 930-936 944 950 954-962 964-966 969 
971-988 993-995 997-998 1000-1002 1007- 
1014 1017-1021 1026-10271032-1034 1036- 
1040 1042-1052 1068-1071 1075-1076 1079 
1089-1091 1095 1110-1113 1117-1119 1128- 
1136 1139-1143 1151-1153 1155-1156 1160- 
1164 1172 1192-1193 1198-1204 1217-1218 
1220-1221 1235-1242 1244-1246 1249-1255 
1281-1282 1287 1294-1297 1300-1301 1315- 
1319 1328 1337-1345 1349-1355 1357-1359 
1374 1380 1386-1387 1390-1393 1397-1401 


WO 01/79449 PCT/US01/086S6 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQIDNOS: 




1403-1406 1408-1409 1418-1420 1422-1423 
1432 1438 1440-1443 1446-1449 1451-1456 
1467-1472 1475-1477 1479-1480 1485 1488- 
1489 1504 1507 1516-1517 1524 1527-1535 
1541-1546 1551-1557 1569-1574 1579 1582- 
1587 1591 1605-1608 1611-1612 1617-1620 
1625 1629-1632 1634-1648 1656 1663 1667- 
1670 1685 1687-1689 1691-1698 1701-1704 
1706 1708-1714 1717 1719-1721 1723-1724 
1727 1737-1739 1743-1746 1753 1755-1756 
1758 1763-1771 1773-1774 1780-1783 1796- 
1804 1807-1817 1830-1844 1857-1860 1870- 
1875 1878 1882-1885 1900-1911 1915-1922 
1925-1927 1948-1962 1964-1974 1978-1979 
1981-1983 1993-1998 2004-2005 2010-2013 
2017-2020 2025-2026 2031-2038 2042-2043 
2045-2048 2052-2054 2058-2061 2068-2071 
2080-2082 2086-2091 2093 2096-2097 2101- 
2102 2111-2114 2118 2121-21242138-2142 
2144-2147 2153-2160 2171 2173-2176 2178- 
2181 2186-2187 2189-2192 2197-2199 2204- 
22 1 3 22 1 6-222 1 2223-2228 2230-2232 2234- 
2242 225 1-2252 2254-2257 2267-2271 2278 
2280-2283 2288-2291 2341 2365 2369-2375 - 
2377-2381 2386 2389-2390 2414-2415 2423 
2452-2454 2456 2469-2474 2492-2494 2496 
2499-2502 2510-2513 2526-2531 2536-2538 
2548 2560-2575 2587 2592 2594-2595 2603- 
2604 2626 2628-2631 2639-2642 2645-2647 
2657-2662 2665 2672-2674 2696-2697 2702- 
2706 2721 2726 2728-2730 2737-2738 2741- 
2743 2748 2754-2758 2760-2761 2774-2778 
2781 2785 2800-2801 2807 2813-2814 2818- 
2819 2821-2826 2829 2831-2832 2836-2841 
2843 2846-2850 2854-2862 2865 2871-2872 
2886-2887 2896-2898 2914-2918 2925-2929 
2931-2939 2945-2946 2960 2969-2975 2977 
2981-2983 2990 3008 3017-3020 3026-3027 
3098-3099 3102-3105 3136-3137 3149 3156- 
3157 3159-3162 3167-3168 3170-3174 3189- 
3191 3195 3213-3214 3218 3229 3231-3233 
3240 3242 3247-3248 3257 3260-3267 3269 
3280-3282 3285 3289-3290 3297-3298 3302 
3312 3319-3324 3331-3335 3349-3352 3396- 
3398 3409-3410 3424-3429 3438-3441 3444- 
3445 3466-3468 3470-3473 3480 3491-3495 
3498-3501 3505-3506 3522 3536-3543 3554- 
3556 3558-3562 3576-3580 3586 3588-3590 
3605 3616-3617 3619-3625 3628 3632-3634 
3638-3640 3658-3660 3664-3665 3667 3670- 
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Name 

SEQ ID NOS: 




3671 3673-3677 3680 3684 3689 3691 3693 
3707 3717-3723 3754 3780-3781 3787-3788 
3792-3793 3804-3807 381 1-3816 3860-3861 
3871-3872 3882-3883 3941-3942 3959-3960 
3969-3971 3997-3998 4014-4015 4040-4042 
4062-4064 4068-4069 4071-4075 4077-4079 
4082-4084 4090-4092 4100 4114-4116 4121- 
4122 4140-4144 4215-4217 4220-4223 4239- 
4244 4255-4257 4267-4270 4273-4274 4291- 
4295 4341 4374-4377 4379-4380 4383-4384 
4391-4393 4397 4430-4432 4549 4570-4571 
4583-4586 4598-4601 4604 4633-4634 4636- 
4637 4651 4672-4674 4748-4749 4764-4766 
4796-4797 4804-4805 4812 4864-4865 4907- 
4910 4953 5038-5039 5046-5047 5050-5052 
5095-5097 5107-5108 5119 5123-5124 5137- 
5140 5142-5143 5148-5150 5177-5179 5184- 
5187 5235 5255 5276-5277 5367-5368 5387- 
5391 5399-5401 5408-5410 5423-5425 5427- 
5429 5442 5497 

Genomic 
clones 
from the 
short arm 
of 

chromoso 
me 8 

Genomic 

DNA 

from 

Genetic 

Research 

EPM001 

2639-2642 

esophagu 
s 

BioChain 

ESO002 

885-886 1639 2223 

fetal 
brain 

Clontech 

FBR001 

153 278-283 863-869 1156 1400-1401 1626 
1691-1692 1727 21 18 2229 2604 2645-2647 
2844-2845 3174 3763 3780-3781 4090-4092 
4140-4141 45454835 

fetal 
brain 

Clontech 

FBR004 

855-856 1017-1021 1470 1580-1581 1839- 
1844 1978-1979 2052-2053 2084 2171 2249 
3197 3451-3455 3713 4960 

fetal 
brain 

Clontech 

FBR006 

30-31 39-40 74-77 116-119 130 137 143-148 
175-177 187-190 195-196 216-218 223-226 
366-367 388-390 400-404 465 491-492 520- 
521 557 602-603 607 647-649 652-653 670- 
671 676 680-681 685 698 724-727 743-744 
760 763 789-793 814-817 824-825 829-831 
836-837 849-851 855-856 885-886 889-897 
944 994 997-998 1000-1002 1017-1021 1026- 
1027 1042-1043 1068-1069 1076 1089-1091 
1095 1139-1142 1151-1153 1156 1176 1182- 
1185 1192 1220-1221 1228 1230-1232 1332- 
1333 1349-1350 1357-1358 1389 1394 1400- 
1401 1403-1404 1408-1409 1413 1455-1456 
1507-1510 1520 1605-1608 1617-1619 1629- 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




1632 1634-1637 1640-1645 1667-1670 1691- 
1692 1694-1698 1707 1715-1716 1727 1735- 
1736 1743-1745 1754 1763-1769 1773-1779 
1792-1804 1814-1817 1870-1875 1900-1902 
1915-1918 1926-1927 1952-1962 2010-2013 
2021-2026 2045-2048 2057 2072-2074 2084 
2088-2091 2095 2118 2131-2133 2188 2191- 
2192 2197-2199 2223 2229-2230 2234-2242 
2255-2257 2288-2291 2386 2419-2420 2432 
2436-2437 2440-2441 2502 251 1-2513 2579- 
2584 2597 2604 2650 2657-2658 2669-2671 
2688-2689 2709-271 1 2728-2729 2740 2749- 
2750 2752-2753 2777-2778 2785 2807 2829 
2846-2850 2857-2860 2888 2925-2929 2969- 
2975 3013-3014 3081 3089 3172-3173 3213- 
3214 3226-3228 3289-3290 3342-3344 3394- 
3395 3402 3406-3407 3438-3441 3466-3468 
3498-3501 3536-3543 3583 3629-36303658- 
3663 3729-3730 3742-3744 3780-3781 3867- 
3869 3882-3883 4100 4125 4132-4133 4291- 
4295 4297-4299 4376-4377 4461-4464 4480 
4534 4546-4549 4581-4586 4614-4615 4629- 
4632 4691-4692 4729 4905-4906 4946 4967- 
4970 5001-5003 5026 5057-5065 5137-5140 
5194 5222-5224 5247 5260 5298-5302 5329- 
5330 5381-5382 5389-5391 5396 

fetal 
brain 

Clontech 

FBRs03 

1870-1875 1878 3424-3427 3554-35564907- 
4909 5137-5140 

fetal 
brain 

Invitrogen 

FBT002 

32-34 59-60 92-96 124 128 137 180-182 192 
195-196 278-283 341-344 436 491-492 520- 
521 583-590 607 647-649 652-653 677-678 
778-783 789-793 822-825 849-854 882-884 
950 957-960 964-966 971-974 980-988 1026- 
1028 1038-1040 1042-1043 1050 1070-1071 
1076 1127 1156 1205-1207 1230-1232 1271 
1281-1282 1322 1337-1345 1349-1350 1360- 
1371 1386-1387 1400-1401 1414-1417 1427- 
1431 1435-1436 1470 1507-1510 1532 1603- 
1604 1617-1620 1633 1649-1653 1674-1682 
1691-1692 1694-1698 1708 1710-1714 1727 
1737-1739 1765-1769 1773-1774 1780-1783 
1805-1806 1839-1844 1852 1870-1875 1882- 
1885 1896 1925 1964-1965 1978-1979 1990- 
1991 1993-1998 2031-2034 2058-2059 2084 
2109-21102118 2173 2186-2187 2193-2195 
2202-2203 2208-2210 2233 2254 2278 2288- 
2291 2305-2306 2414-2415 2496 251 1-2513 
2537-2538 2558-2559 2573 2579-2584 2590 
2597 2604 2629-2631 2639-2642 2659-2662 
2672-2674 2696-2697 2719-2720 2741-2743 
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origin 

RNA 
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Library 
Name 

SEQ ID NOS: 




2751 2762 2777-2778 2782 2787-2789 2792 
2843 2865 2886-2887 2945-2946 2951-2952 
2961-2962 3035-3036 3082 3109-3110 3149 
3195 3199 3205 3212 3310-3311 3339-3340 
341 1-3417 3424-3427 3444-3445 3498-3501 
3516-3517 3577-3580 3629-3630 3632-3634 
3638-3640 3658-3660 3684 3693 3714 3716 
3871-3872 3930 4004 4019 4082-4084 4098 
4100 4104-4105 4126 4145-4153 4241-4242 
4520 4549 4569 4616-4621 4672-4674 4761- 
4763 4861 5137-5140 5148-5149 5298-5302 
5310-5311 5328 5408-5410 5485 

fetal heart 

Invitrogen 

FHR001 

909 1089-1091 1128 1256 1514 1621-1623 
3354 4228 

fetal 
kidney 

Clontech 

FKD001 

30-31 137 154-156 212 278-283 313-315 
326-327 370-371 379-380 491-492 551-553 
595 602-604 665-667 680-681 736-740 743- 
744 822-823 900-902 950-956 995 1023-1025 
1035 1085 1089-1091 1182-1184 1230-1232 
1300-1301 1332-1333 1353-1354 1357-1359 
1386-1387 1446-1449 1457-1462 1479 1515 
1532 1551-1552 1580-1581 1588-1589 1612 
1617-1619 1629-1632 1663 1667-1670 1719- 
1721 1724 1746 1752-1754 1796-1798 1831- 
1834 1845-1847 1896-1897 1925-1927 1951 
1981-1983 1993-1998 2035 2045-2047 21 1 1- 
21 14 21 18 2144 2224-2228 2253 2360 2422 
2440-2441 2502 2510 2526-2527 2549-2550 
2645-2647 2650 2693 2763 2774-2776 2781 
2831-2832 2844-2845 2879-2881 2898 2913 
2960 2974-2975 2979 3031-3032 3054 3198 
3230 3276-3277 3304-3307 3372-3373 3442 
3446-3449 3491-3495 3536-3543 3714 3780- 
3781 3853 4030-4031 4055-4056 4093 4581- 
4582 4679 4864-4865 4907-4910 5001-5003 
5038-5039 5050-5052 5142-5143 5148-5149 
5329-5330 5372-5374 

fetal 
kidney 

Clontech 

FKD002 

313-315 551-553 699-701 743-744 784-786 
1017-1021 1173 1182-1184 1403-1404 1753 
2055-2056 21 16 2118 2223 2253 4598-4601 
4907-4909 5001-5003 

fetal 
kidney 

Invitrogen 

FKD007 

45-46 491-492 849-851 950-953 1507 1575- 
1576 1746 2060-2061 2086-2087 2134-2136 
2204-2207 2223-2228 2380-2381 2579-2584 
3242 4581-4582 

fetal lung 

Clontech 

FLG001 

64 350-352 453-455 551-553 736-740 822- 
823 863-869 997-998 1000-1002 1035 1042- 
1043 1193 1275-1277 1317-1318 1374 1648 
1674-1682 1707 1727 1746 1753 1830 1835- 
1844 1852 1870-1875 1951 2004 2066-2071 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




2134-2136 2148-2150 2253 2276 2386 2397- 
2399 2452-2454 2522 2645-2647 2657-2658 
2677-2680 2713-2714 2739 2774-2776 2831- 
2832 2866-2867 2873-2874 3013-3014 3100 
3535 3548-3549 3666 3686-3688 4126 4136 
4569 5188-5189 5389-5391 

fetal lung 

Invitrogen 

FLG003 

195-196 278-283 341-344 388-390 395 
450-451 491-492 849-851 879-881 885-886 
950 971-979 995 1128 1193 1237-1242 1269- 
1270 1386-1387 1450 1507 1514 1605-1608 
1709 1746 1780-1783 1830 1835-1838 1848- 
1852 1860 1865-1875 1990-1991 2010-2013 
2060-2061 2072-2074 2094 2174-2176 2233 
2253 2283 2526-2527 2579-2584 2594-2595 
2836-2839 2844-2845 2888 2896-2897 2913 
2951-2952 2979 2985 3008 3123 3149 3200- 
3202 3212 3258-3259 3280-3282 3466-3468 
3508-3513 3536-3543 36053629-3630 3691 
3749-3752 3793 3840-3841 4016-4018 4114- 
4116 4258 4330-4331 4598-4601 4897 5102- 
5104 5119 5276-5277 

fetal lung 

Clontech 

FLG004 

154-156 971-974 1070-1071 1182-1184 
1527-1531 1701-1702 1753 1896 3462 3629- 
3630 5001-5003 5241-5242 

fetal 
liver- 
spleen 

Columbia 
University 

FLS001 

1-13 24-27 29-50 52-99 111-113 115 126 
133-134 136 140-142 154-156 166-192 195- 
222 227-230 232-236 241-283 286-287 291 
307-310 313-327 330-334336-361 365-367 
369-375 379-383 386-394 396-420 422-431 
435.446 453-456 461 474-475 478-481 483- 
505 507-532 534-545 548-553 557 561-562 
565-567 569-577 580-581 583-607 611-620 
629-631 633-650 652-653 655-662 682-684 
699-701 704-706 709-711 724-727 736-740 
743_744 747-750 755-759 773-829 832-835 
839-854 857-877 882-886 889-905 909-921 
924 927-966 968-969 971-988 990-995 997- 
1014 1017-1050 1052-1055 1058-1059 1063- 
1074 1076 1078-1082 1085-1088 1092 1094 
1096-1104 1107-1108 1110-1113 1115-1121 
1124 1127-1145 1148 1150-1175 1177-1223 
1225-1256 1263-1289 1292-1301 1307-1327 
1332-1335 1337-1345 1349-1350 1352-1355 
1357-1371 1374-1379 1386-1387 1389-1393 
1395-1397 1400-1401 1403-1406 1408-1410 
1414-1423 1425 1432 1434 1437-1438 1440- 
1444 1446-1462 1467-1473 1479-1480 1485- 
1486 1495-1500 1504 1507-1510 1513-1514 
1518-1519 1522-1524 1527-1538 1540-1548 
1551-1557 1559-1576 1579-1608 1611-1623 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQIDNOS: 




1625-1682 1686 1688-1740 1742 1746-1756 
1759-1806 1814-1819 1822-1830 1835-1852 
1857-1860 1865-1887 1894-1895 1897-1927 
1948-1974 1978-1984 1988-2030 2036-2037 
2042-2043 2045-2048 2052-2056 2058-2064 
2078-2079 2084 2086-2087 2095 2098-2102 
2111-2114 2116 2118 2134-2136 2138-2150 
2153-2184 2186-2213 2216-2257 2262 2264- 
2273 2275-2307 2310-2323 2326-2368 2376- 
2381 2383-2386 2388 2395 2405-2413 2419- 
2420 2440-2441 2449 2452-2454 2456-2457 
2465 2467 2469-2480 2482-2504 2506 2509- 
2520 2524-2544 2546-2562 2564-2590 2592 
2594-2596 2600-2616 2626-2697 2706-2708 
2712-2718 2721 2726 2728-2732 2748 2752- 
2758 2761-2762 2764 2774-2776 2784 2787- 
2789 2796-2798 2800-2801 2803 2807 2816- 
2819 2821-2839 2842-2862 2864 2868-2875 
2879-2885 2888-2893 2896-2897 2899-2900 
2904-2906 2909-2922 2925-2939 2942-2943 
2945-2947 2951-2955 2957 2959-2963 2969- 
2976 2978-2979 2981-2984 2986-2992 2995- 
2999 3001-3014 3016-3037 3041-3049 3054- 
3059 3065-3072 3074-313-7 3139-3147 3150. 
3168 3172-3174 3181-3183 3195 3197 3200- 
3202 3210-3212 3215-3218 3220-3222 3224- 
3225 3240-3241 3253-3255 3258-3267 3270- 
3295 3297-3340 3342-3344 3347-3348 3353- 
3360 3363-3368 3370-3382 3384-3422 3424- 
3432 3435-3450 3458-3460 3462-3469 3472- 
3501 3505-3519 3522-3532 3535-3556 3561- 
3565 3572 3574-3576 3588 3590-3593 3605 
3621-3623 3628-3641 3650-3652 3654-3685 
3691-3693 3697-3705 3707-3716 3723-3726 
3729-3730 3735 3737-3739 3741-3782 3791- 
3793 3804-3806 3811-3816 3822-3824 3826 
3835 3845 3847-3849 3856-3857 3860-3880 
3885-3896 3906 3908 391 1-3919 3924-3951 
3955-3960 3962-3972 3997-4000 4002-4018 
4020 4024-4032 4035-4056 4059-4081 4090- 
4100 4104-4105 41 14-4116 4136 4144-4153 
4157-4158 4167-4168 4209-4210 4219-4223 
4228 4239-4244 4248 4250-4258 4260-4295 
4297-4299 4320-4326 4329-4339 4341 4343- 
4351 4353 4355-4356 4358-4359 4363 4367- 
4384 4386-4394 4397 4399-4400 4402-4412 
4414 4419-4465 4467-4470 4472-4476 4479- 
4489 4537-4541 4554-4555 4566 4570-4571 
4575-4577 4581-4586 4598-4604 4606 461 1- 
4621 4624-4644 4648-4659 4661-4675 4677- 
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origin 

RNA 
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Name 
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4678 4684-4689 4691-4696 4698 4701-4709 
4729-4760 4764-4766 4768-4771 4777-4780 
4796-4797 4807-4808 481 1-4818 4820-4836 
4838-4839 4841 4843-4848 4852-4853 4855- 
4858 4861 4864-4865 4867-4879 4881-4891 
4893-4925 4963 4976-4979 4987-5013 5019- 
5041 5043-5065 5071-5073 5075-5088 5092- 
5104 5107-5111 5114-5115 5117-5118 5120- 
5121 5123-5128 5130-5131 5137-5140 5145 
5147-5152 5154-5162 5164-5176 5180-5197 
5201 5208 5210-5236 5240-5243 5246-5247 
5250-5251 5268-5269 5276-5277 5295-5296 
5298-5302 5310-5311 5317-5320 5323-5325 
5335-5343 5346-5354 5372-5373 5384 5386- 
5391 5393-5398 5402-5410 5413-5415 5417- 
5420 5423-5462 5467-5476 5482-5483 5485- 
5497 

fetal 
liver- 
spleen 

Columbia 
University 

FLS002 

3-1 1 24-27 35 38 42 44 48-50 57 67 70 73-77 
85 88 98 107-111 136-142 151-153 165 173 
195-196 198 201-205 210-215 219222 232- 
234 236 245 252-254 258-266.277 291 316- 
320 332-333 337 354 357-361 365 374-375 
381-383 394 406 415-416 418 436-438 445- 
446 461 478-480 486 489-490 520-521 527 
538 540 543 548-553 574-577 599-601 607 
612-620 647-649 677-678 682-685 699-706 
709-71 1 736-740 747-750 755-759 777 788- 
793 814-816 818 822-828 833 852-854 863- 
869 873-877 885-886 889-897 899-902 906- 
914 916-920 924 927-936 946-949 951-956 
961-962 969 975-988 990-991 993-995 999- 
1014 1023-1037 1041-1047 1052 1055 1063- 
1067 1070-1071 1076 1080 1085 1088 1108 
1110-1119 1124 1128-1142 1144-1145 1148 
1151-1156 1158 1160-1170 1172-1175 1177- 
1184 1186-1187 1192-1193 1195-1197 1202- 
1204 1208-1212 1215-1218 1220-1221 1225- 
1227 1235-1236 1244-1246 1249-1256 1263 
1266-1273 1278-1280 1285-1291 1297-1301 
1307 1315-1316 1320-1327 1332-1333 1349- 
1350 1352-1355 1357-1371 1374-1379 1385- 
1387 1389 1395-1397 1405-1406 1410 1414- 
1417 1421-1423 1425 1427-1432 1437 1442- 
1444 1451-1456 1463-1464 1470-1473 1475- 
1477 1479-1480 1485 1498-1499 1515 1536- 
1538 1540-1546 1550-1557 1559-1560 1580- 
1585 1597 1603-1608 1612-1616 1620 1625- 
1627 1629-1632 1638-1653 1656 1661-1662 
1664 1667-1682 1685 1691-1692 1694-1699 
1701-1704 1706-1707 1709-1714 1717 1719- 
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1721 1725-1734 1737-1739 1747-1751 1753- 
1756 1759-1769 1772 1775-1795 1799-1813 
1819 1827-1844 1852 1857-1859 1865-1875 
1879-1881 1888-1893 1896 1900-1902 1913- 
1918 1926-1927 1948-1950 1952-1962 1966- 
1974 1988-1992 2000-2007 2010-2020 2025- 
2030 2040-2044 2048-2054 2062-2064 2068- 
2071 2080-2082 2086-2091 2095 2107 2111- 
2116 21 18 2121-2133 2143 2145-2150 2153- 
2160 2171 2179-2181 2188-2190 2196-2201 
2204-2210 2216-2221 2223-2229 2231-2233 
2244-2248 2251-2252 2254-2257 2261-2262 
2266 2271 2275-2277 2281-2284 2286-2293 
2298-2299 2301-2302 2305-2306 2311-2312 
2314-2320 2326-2332 2336-2340 2342-2356 
2358-2361 2364-2368 2380-2381 2385 2388- 
2391 2395-2399 2414-2415 2421 2427 2440- 
2441 2449 2452-2454 2456 2459-2460 2465 
2467-2468 2473-2475 2477-2479 2482-2491 
2497-2504 251 1-2518 2521-2522 2524-2525 
2528-2535 2537-2538 2542 2546 2548 2552 
2554-2571 2573-2578 2586 2592 2594-2595 
2604 2624 2626-2627 2629-2632 2637 2643 
2645-2648 2656 2659-2662 2665-2680 2686- 
2692 2694-2695 2706 2709-2718 2726 2731- 
2732 2734-2735 2741-2743 2747-2750 2754- 
2759 2761 2774-2776 2781 2787-2789 2793 
2800-2801 2808 2813 2818 2824-2826 2834- 
2842 2846-2847 2854-2862 2865 2869-2874 
2883-2889 2892 2896-2897 2899-2900 2914- 
2917 2921-2922 2925-2929 2951-2952 2954 
2957 2959 2961-2962 2974-2975 2978-2979 
2995 3001-3005 3007-3009 3013-3014 3017- 
3023 3025-3027 3029 3041-3049 3056-3059 
3065-3068 3071 3075-3079 3082 3088 3093- 
3095 3098-3100 3102-3105 3117-3118 3122 
3132 3139-3148 3150-3153 3167-3168 3172- 
3174 3195 3197 3199-3202 3213-3216 3220- 
3222 3224-3225 3229 3237 3240 3243 3269- 
3273 3276-3277 3283-3284 3288 3291 3294 
3297-3299 3301 3304-3311 3313 3316-3322 
3326-3330 3333-3338 3342-3344 3348 3355 
3359 3372-3373 3386 3390 3394-3398 3402 
3409-3410 3418 3424-3429 3431-3432 3435- 
3436 3438-3441 3456 3459-3460 3466-3468 
3477-3478 3480 3482 3484-3489 3491-3495 
3498-3501 3519 3523-3532 3535-3556 3561- 
3562 3574-3575 3577-3580 3583-3585 3590 
3608 3610-3613 3616-3617 3621-3625 3628- 
3630 3638-3641 3658-3660 3663 3672-3677 
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3682 3691 3693 3697 3699 3704-3705 3707- 
3708 3714-3716 3723-3725 3735-3736 3745- 
3747 3749-3752 3754 3758-3762 3765 3768 
3770-3773 3775-3777 3782 3784-3786 3792 
3808 3822-3824 3826 3828 3834-3835 3846 
3853 3856-3857 3864-3865 3870 3875 3878- 
3883 3886 3889 3894 3897-3906 3908 391 1- 
3919 3925-3927 3931-3933 3935-3936 3955- 
3958 3963-3971 3973-3983 4014-4015 4021 
4023 4027-4029 4040-4045 4048-4052 4060- 
4064 4067 4076-4080 4085-4086 4093 4096- 
4098 4100 4104-4105 4109-41 1 1 41 14-41 16 
4121-4122 4136 4157-4158 4220-4223 4227- 
4228 4232-4235 4243-4244 4250-4253 4255- 
4257 4260 4273-4277 4289-4295 4322 4342 
4348 4359 4376-4377 4396 4410-4412 4430- 
4432 4442 4450-4452 4456 4461-4464 4468 
4533 4569-4571 4577 4581-4582 4613 4616- 
4621 4629-4632 4638 4640 4643 4649 4651- 
4653 4657 4659 4664-4665 4693-4694 4701- 
4709 4729 4731-4732 4737-4741 4744-4746 
4750 4764-4766 4768-4771 4777-4780 4785- 
4786 4807-4808 4820-4821 4824-4826 4832- 
4834 4836 4854 4856 4859-4860 4872 4874- 
4875 4882-4883 4885-4886 4888-4889 4896 
4898 4904-4909 4926 4953 4967-4970 4998- 
4999 5034-5036 5055-5056 5073 5075-5078 
5082-5083 5095-5099 5107-5108 5123-5124 
5137-5140 5145 5166-5167 5169 5174-5175 
5180 5188-5192 5208 5222-5224 5227 5244- 
5245 5323-5325 5372-5373 5386-5391 5393 
5399-5403 5414-5418 5427-5429 5431-5433 
5440-5441 5448-5451 5455 5458 5467-5476 
5489 

fetal 
liver- 
spleen 

Columbia 
University 

FLS003 

210-211 341-344 849-851 1089-1091 1177- 
1179 1310-1314 1320-1321 1349-1350 1440- 
1441 1514 1557 1624 1648 2042-2043 2134- 
2136 2223 2253-2254 251 1-2513 2533 2843 
2979 4163-4166 4273-4274 4687-4689 4738- 
4739 4998-4999 5075-5077 5414-5415 5452- 
5453 

fetal liver 

Invitrogen 

FLV001 

3-11 52 246-247 255-260 278-283 291 341- 
344 491-492 596 652-653 709-711 724-727 
778-783 814-816 840-844 849-851 882-886 
903-905 946-949 964-966 971-988 997-998 
1003-1006 1010-1014 1026-1027 1038-1040 
1044-1047 1063-1068 1070-1071 1089-1091 
1137-1138 1143 1171 1177-1179 1182-1184 
1193 1198-1201 1205-1207 1310-1314 1317- 
1318 1320-1321 1337-1345 1349-1350 1360- 
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1362 1414 1421 1425 1446-1449 1470 1479 
1488 1507 1513-1514 1540 1547-1548 1550 
1575-1576 1603-1604 1621-1623 1626 1694- 
1698 1703-1704 1708-1714 1717 1743-1746 
1773-1779 1796-1804 1835-1844 1860 1870- 
1875 1888-1893 1913-1914 1925-1927 1954- 
1962 1978-1979 1981-1983 2101-2102 2111- 
2114 21 18 2134-2136 2146-2147 2161-2162 
2174-2176 2184 2186-2188 2200-2201 2233 
2253 2258-2260 2276 2284 2310 2338-2340 
2380-2381 2388 2526-2527 2629-2631 2672- 
2676 2800-2801 2829 2857-2860 2864 2879- 
2882 2951-2952 2957 2976 2979 2992 3246 
3253-3255 3288 3297-3298 3328-3332 3391 
3488 3561-3562 3624-3625 3632-3634 3638- 
3640 3661-3662 3760 3784-3786 3828 3847- 
3849 4004 4036-4039 4060-4061 4068-4069 
4209-4210 4241-4242 4275-4277 4291-4295 
4335-4337 4391-4393 4410-4412 4451-4452 
4477 4616-4621 4633-4634 4636-4637 4651 
4672-4674 4687-4689 4729 4738-4739 4754- 
4755 4796-4797 5075-5077 5082-5083 5102- 
5104 5130-5131 5148-5149 5241-5242 5310- 
5311 5383 5387-5388 5452-5453 5467 5485 

fetal liver 

Clohtech 

FLV002 

1411 1605-1608 1625 4581-4582 5323-5325 

fetal liver 

Clontech 

FLV004 

210-21 1 258-260 313-315 652-653 852-854 
863-869 971-974 1116 1128 1171 11774179 
1182-1184 1192 1205-1207 1374 1422-1423 
1605-1608 1613-1616 1626 1648 1701-1702 
1725-1726 1775-1779 1913-1914 2116 2134- 
2136 2188 2223 2253 2484-2487 2677-2680 
3076-3078 3328-3330 4451-4452 4777-4780 
4998-4999 5050-5052 5082-5083 5310-5311 
5387-5388 5414-5415 

fetal 
muscle 

Invitrogen 

FMS001 

3-11 154-156 216-218 278-283 313-315 341- 
344 388-390 395 478 491-495 51 1 591 652- 
653 704-706 814-816 822-823 889-897 903- 
908 925-926 928-929 931-936 946-950 957- 
960 980-988 993 1017-1021 1048-10501063- 
1068 1171 1297-1299 1307 1320-1321 1359 
1444 1507-1510 1514 1533-1535 1540 1553- 
1556 1585 1605-1608 1639 1694-1698 1710- 
1714 1717 1746 1753 1773-1774 1780-1783 
1805-1813 1860 1879-1885 1915-1918 2004 
2042-2043 2107 2118 2134-2136 2148-2150 
2161-2162 2197-2199 2254-2257 2281-2282 
2503-2504 2665 2686 2824-2826 2843 2852- 
2853 2896-2897 2979 2985 3054 3058-3059 
3159-3162 3213-3214 3226-3228 3280-3282 
3299 3323-3324 3365-3366 3372-3373 3658- 
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3660 4391-4393 4410-4412 4558 4560 4613 
4622-4623 4635 4796-4797 4862-4865 4870 
4904 4907-4909 4967-4970 5025 5137-5140 
5387-5388 5393 

fetal 
muscle 

Invitrogen 

FMS002 

341-344 652-653 1298-1299 1389 1400-1401 
1507 1727 1746 1753 2042-2043 2191-2192 
2224-2228 2761 2979 3123 3257 4285-4286 
4410-4412 

fetal skin 

Invitrogen 

FSK001 

1-2 39-40 70 92-95 137 157-159 175-177 
213-215 246-247 278-283 291 298-300 313- 
315 341-344 365 370-371 388-390 419 445- 
446 452 478-480 51 1 516 522-524 538-539 
548-553 580-581 597-598 602-603 633-634 
647-649 652-653 677-678 685 709-71 1 784- 
786 789-793 814-816 824-829 849-851 863- 
870 879-884 903-905 909 919-920 925-926 
946-949 957-960 980-988 992-994 997-1002 
1010-1014 1017-1021 1035 1042-1047 1050- 
1051 1076 1078 1110-1113 1117-1119 1129- 
1130 1151-1155 1160-1164 1182-1184 1198- 
1204 1237-1243 1256 1271 1290-1291 1307 
1310-1314 1320-1321 1323-1327 1351 1355 
1357-1359 1380 1385 1390-1393 1400-1401 
1414 1418-1420 1432 1435-1436 1450 1457- 
1462 1479 1488-1489 1507-1510 1524 1533- 
1535 1547-1548 1550-1552 1567-1568 1575- 
1576 1579 1585 1588-1589 1611 1617-1619 
1621-1623 1653-1655 1663 1686 1688-1689 
1691-1692 1694-1698 1703-1704 1710-1714 
1743-1746 1753 1765-1771 1773-1774 1780- 
1783 1807-1813 1830-1834 1848-1852 1865- 
1878 1882-1885 1903-1911 1915-1918 1925- 
1927 1954-1962 1964-1965 1981-1983 1990- 
1991 2006-2007 2017-2030 2038 2054 2068- 
2071 2076-2079 2088-2091 2098-2100 2107 
2118 2145 2153-2155 2173 2177 2179-2181 
2188 2191-2192 2204-2210 2214-2215 2246- 
2248 2251-2253 2267-2271 2277 2280 2286- 
2291 2305-2306 2310 2338-2340 2376 2386 
2432 2434-2435 2437 2469 2483 2490-2491 
2510-2513 2526-2527 2560 2563 2572-2573 
2588-2589 2594-2595 2603 2628 2659-2662 
2696-2697 2734 2741-2743 2754-2758 2782 
2787-2789 2813 2819 2824-2826 2828 2831- 
2832 2843-2845 2855-2860 2865 2873-2874 
2905 2914-2917 2925-2929 2945-2946 2951- 
2952 2955 2961-2962 2965-2975 2979 2981- 
2983 2985 2989 2996-2998 3000 3008 3023 
3082 3109-31 10 3151-3153 3156-3157 3167- 
31683195 3213-3216 3220-3222 3234 3247- 
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3248 3253-3255 3258-3259 3280-3282 3288- 
3290 3299 3310-3311 3313 3354 3396-3398 
3406-3407 3424-3427 3442 3451-3455 3462 
3472-3473 3536-3545 3550-3552 3576 3595- 
3596 3599-3601 3624-3625 3627 3632-3634 
3658-3660 3691 3714 3716 3760 3791-3792 
3815-3816 3871-3872 3897-3905 3908 3925- 
3926 3941-3942 3959-3960 3966-3968 4068- 
4069 4071-4075 4077-4079 4082-4086 4100 
4108 4123 4131 4137 4209-4210 4228 4272 
4275-4277 4330-4331 4335-4337 4376-4377 
4379-4380 4410-4412 4537-4541 4558 4581- 
4582 4652-4653 4672-4674 4691-4692 4740- 
4741 4764-4766 4776 4796-4797 4804-4805 
4815 4832-4834 4864-4865 4907-4910 4921 
4939-4940 4967-4970 5095-5097 5137-5140 
5148-5149 5154-5157 5235 5310-5311 5329- 
5330 5367-5368 5423-5425 5427-5429 5431- 
5433 

fetal skin 

Invitrogen 

FSK002 

313-315 341-344 366-367 551-553 971-974 
1038-1040 1151-1153 1271 1353-13541507- 
1510 1588-1589 1755-1756 1870-1875 1903- 
1911 1926^1927 1952-1953 2017-2020 2027- 
2030 2078-2079 2197-2199 2377-2379 2669- 
2671 2677-2680 2931-2939 3167-3168 3189- 
3191 4082-4084 4613 4907-4909 5423-5425 

fetal 
spleen 

BioChain 

FSP001 

175-177 743-744 1171 1202-1204 1457-1462 
1753 2060-2061 2116 2143 2223 2253 2728- 
2729 4167-4168 5001-5003 5335-5343 5444 

umbilical 
cord 

BioChain 

FUC001 

1-2 30-31 47 52 70 74-77 100-106 154-156 
191 201-205 258-260 267-268 270-272 277- 
283 298-303 341-344 370-371 379-380 388- 
390 445-446 457 461 478-480 491-492 557 
592-594 605-606 652-653 677-678 743-744 
814-816 822-823 847 849-851 863-869 903- 
905 909-914 91 9-920 944 950 957-960 970- 
974 980-988 994-995 1038-1040 1042-1043 
1049 1052 1069-1071 1075 1108 1117-1119 
1 128-1 130 1 151-1 154 1 160-1 170 1 173 1 182- 
1184 1189-1191 1198-1207 1220-1221 1230- 
1232 1256 1269-1270 1294-1301 1320-1321 
1349-1350 1355 1357-1358 1363-1371 1374 
1389 1397 1410 1422-1423 1439-1443 1450- 
1454 1470 1473 1479 1486 1488 1497-1499 
1507-1510 1514 1522-1523 1532-1535 1551- 
1552 1569-1574 1577-1578 1599-1602 1611 
1620 1648-1652 1661-1663 1671-1673 1688- 
1689 1694-1699 1703-1704 1706-1707 1710- 
1716 1719-1721 1724 1737-1739 1746 1753 
1755-1756 1773-1774 1780-1783 1796-1798 
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1831-1838 1848-1851 1860 1870-1875 1882- 
1885 1903-1912 1915-1918 1925 1951 1990- 
1991 2000-2002 2021-2024 2027-2034 2038 
2042-2043 2045-2047 2050-2051 2054-2056 
2060-2061 2080-2082 2086-2087 2096-2100 
2118 2125-2130 2134-2136 2144 2146-2147 
2156-2160 2163-2168 21702177 2193-2195 
2202-2207 2223-2230 2234-2242 2251-2253 
2267-2270 2273 2275 2278 2284 2294-2295 
2298-2299 2318-2320 2347-2348 2386 2392 
2404 2417-2418 2422 2425 2431 2440-2441 
2452-2454 2465 2499-2501 2510 2519-2520 
2528-2531 2536 2546 2558-2562 2574-2575 
2579-2584 2592 2594-2595 2645-2647 2650 
2659-2662 2686 2696-2697 2702-2706 2762- 
2763 2768-2771 2774-2778 2781 2800-2801 
2803 2805-2807 2820 2824-2829 2831-2832 
2843 2846-2847 2854 2865 2894-2895 2905 
2974-2976 2979 2981-2984 3008 301 1-3014 
3084-3085 3100 3139-3147 3159-3162 3167- 
3168 3172-3173 3181-3183 3189-3191 3204 
3212-3214 3218 3226-3228 3243 3253-325S 
3260-3267 3280-3282 3299 3304-3307 3310- 
3311 3313-3314 3331-3335 3342-3344 3365- 
3366 3402-3404 3409-3410 3418 3428-3429 
3451-3455 3466-3468 3486-3487 3491-3495 
3522 3577-3580 3595-3596 3616-3617 3619- 
3620 3624-3625 3628 3635 3638-3640 3663 
3691 3723 3735 3780-3781 3792 3795-3800 
3830 3847-3849 3853 3870 3882-3883 3955- 
3957 3959-3960 3969-3971 4001 4005-4007 
4014-4015 4043-4045 4060-4064 4098 4126 
4140-4141 4167-4169 4209-4210 4239-4242 
4267-4270 4326 4342 4391-4393 4410-4412 
4581-4582 4624-4626 4629-4634 4638 4693- 
4694 4731-4732 4740-4741 4748-4749 4756- 
4757 4785 4796-4797 4807-4808 4864-4865 
4886 4907-4910 4994-4995 5095-5097 5142- 
5143 5154-5157 5177-5179 5222-5224 5310- 
5311 5367-5368 5387-5388 

fetal 
brain 

GIBCO 

HFB001 

3-11 32-34.39-40 42 78 81-82 100-110 116- 
119 124-142 154-156 165 175-177 195-196 
201-205 212-218 220 278-283 286-287 291- 
296 313-315 335 341-344 346-349 366-367 
379-380 388-390 396-398 419 456-461 491- 
492 511 551-553 557 561-562 574-577 583- 
590 651-653 676-679 682-694 697-711 743- 
744 784-786 804-805 814-816 822-825 848- 
851 855-859 863-869 871-872 882-884 899- 
902 915-918 927 930-936 944-945 951-953 
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957-962 964-966 969 971-988 990-995 997- 
1002 1008-1009 1017-1027 1036-1040 1042- 
1043 1049-1050 1053-1054 1069-1071 1076 
1079 1085 1095 1110-1113 1117-1119 1127- 
1130 1143 1154-1156 1159 1173 1175 1182- 
1184 1193 1198-1207 1228-1232 1235-1236 
1281-1282 1287 1290-1306 1315-1316 1320- 
1321 1328 1349-1350 1352 1355 1357-1362 
1385-1417 1421-1441 1444-1445 1450 1455- 
1464 1467-1485 1498-1499 1507 1515-1517 
1520 1524 1536-1538 1540 1547-1548 1551- 
1552 1586-1587 1605-1608 1617-1620 1625 
1634-1637 1639 1649-1653 1656 1663 1665 
1667-1673 1691-1692 1694-1698 1701-1702 
1706-1708 1710-1714 1719-1721 1723-1724 
1728-1739 1743-1746 1753-1756 1758 1763- 
1771 1773-1774 1780-1790 1792-1798 1805- 
1806 1814-1817 1830-1838 1852 1860 1870- 
1877 1879-1881 1903-1911 1915-1918 1926- 
1927 1948-1951 1954-1962 1964-1974 1978- 
1979 1981-1983 1993-1998 2000-2002 2010- 
2013 2021-2038 2042-2043 2045-2048 2052- 
2082 2084-2100 21 11-2114 21 18 2131-2136 
2144-2147 2153-2160 2174-2177 2179-2181 
2186-2188 2204-2210 2214-2215 2223-2228 
2230 2234-2242 2254-2260 2266 2273 2275 . 
2278 2296-2299 2310 2331 2333-2334 2341 
2380-2382 2391-2392 2403-2435 2437-2447 
2450 2456 2467 2469 2473-2474 2483 2488 
2499 2502 2514-2516 2519-2520 2522-2523 
2536 2549-2550 2573-2575 2586 2592 2596 
2598 2603-2604 2628 2644-2648 2650 2659- 
2662 2677-2680 2700 2702-2706 2726 2745- 
2765 2768-2789 2791-2792 2820 2824-2826 
2843 2855-2860 2865 2894-2895 2898 2945- 
2946 2954 2964-2973 2978 2981-2983 3017- 
3020 3026-3027 3084-3085 3100 3102-3105 
3110 3130 3136-3162 3167-3168 3170-3174 
3181-3183 3189-3191 3195-31973199-3234 
3240 3242 3247-3248 3253-3255 3257 3280- 
3282 3285 3289-3290 3299 3302 3310-331 1 
3326-3327 3339-3340 3372-3373 3394-3395 
3424-3427 3446-3449 3457 3519 3522 3524- 
3532 3544-3545 3553-3556 3558-3560 3574- 
3597 3599-3601 3616-3617 3619-3620 3628- 
3630 3638-3640 3680 3689 3693 3701 3716 
3729-3730 3760 3778 3784-3786 3791 3810- 
3828 3830-3835 3864-3865 3882-3884 3891 
391 1-3919 3925-3926 3928 3941-3942 3955- 
3957 3972 4004 4023 4027 4043-4045 4055- 


WO 01/79449 PCTAJS01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




4056 4062-4064 4071-4075 4077-4079 4082- " 
4093 4098 4100-4102 4114-4116 4121-4134 
4136-4144 4157-4158 4220-4223 4230-4235 
4245 4252-4253 4285-4288 4291-4295 4322 
4379-4380 4477-4478 4534 4537-4552 4554- 
4571 4598-4601 4616-4623 4691-4694 4740- 
4741 4781 4783-4788 4790-4791 4864-4865 
4873 4907-4910 4946-4947 4950-4960 5000- 
5003 5085-5088 5123-5124 5137-5140 5142- 
5143 5148-5149 5151-5152 5163 5190-5192 
5222-5224 5278-5292 5294-5306 5329-5330 
5346-5354 5367-5368 5387-5391 5416 5427- 
5429 5485 5490-5491 5497 

macropha 
ge 

Invitrogen 

HMP001 

244 278-283 440-441 445-446 794-795 855- " 
856 995 999 1017-1021 1353-1354 1507 
1582-1584 2223 4228 4864-4865 5490-5491 

infant 
brain 

Columbia 
University 

IB2002 

32-35 39-40 45-47 64 70 74-77 81-82 92-95 
100-110 116-119 124 126 136 154-156 175- 
177 180-182 195-196 213-218 227-230 246- 
247 254 278-283 291 296 340 346-352 362 
364-365 388-390 413-414 419 445-446 459 
491-492 509 511 551-553 574-577 579-590 
592-594 607 652-653 675-676 680-681 743- 
744 755-757 789-793 796 806-808 824-825 
832 849-851 855-859 863-872 900-918 924 
927 944 951-956 964-966 971-988 990-995 
997-998 1008-1009 1022 1026-1027 1036- 
1040 1042-1043 1049-1054 1069-1071 1088- 
1091 1110-1113 1117-1121 1127 1129-1130 
1139-1143 1154-1155 1159 1172-1173 1175 
1180-1181 1192-1193 1198-1207 1217-1218 
1220-1221 1230-1232 1235-1236 1256 1263 
1274 1281-1282 1290-1291 1297 1300-1301 
1307 1315-1316 1319-1321 1328 1334-1335 
1349-1350 1357-1359 1363-1371 1394-1399 
1402-1404 1410-1411 1413-1420 1422-1424 
1427-1431 1437 1439-1441 1444 1451-1462 
1465-1470 1479 1485 1498-1499 1507-1510 
1540 1547-1548 1550-1552 1580-1584 1586- 
1587 1592 1603-1608 1617-1620 1638-1639 
1646-1648 1653 1656 1664-1673 1693-1699 
1719-1721 1727-1734 1737-1739 1743-1745 
1752-1756 1763-1769 1773-1774 1780-1783 
1805-1806 1814-1817 1830-1834 1848-1852 
1865-1885 1896-1897 1899 1903-1911 1926- 
1927 1951-1962 1964-1974 1978-1979 1990- 
1991 2000-2003 2010-2013 2017-20202025- 
2030 2052-2056 2058-2061 2066-2067 2092 
2098-2100 2131-2133 2138-2144 2151-2152 
2161-2162 2171 2177 2186-2190 2200-2201 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




2208-2210 2214-2215 2224-2228 2244-2245 
2249 2251-2253 2273 2288-2291 2296-2299 
2303-2304 2310-2312 2333-2334 2361 2386 
2414-2415 2417-2423 2427 2430 2432 2437 
2439 2465 2499 251 1-2513 2526-2527 2536 
2548-2550 2552 2558-2559 2574-2575 2579- 
2584 2587 2590-2597 2603 2626 2628 2637 
2644 2648 2656 2659-2662 2664 2677-2680 
2686 2692 2702-2705 2734 2741-2743 2745 
2747-2748 2751-2753 2779 2781 2784 2786 
2813 2818-2819 2821-2823 2828 2843 2848- 
2850 2863 2865 2869-2870 2896-2898 2913 
2925-2929 2931-2939 2951-2952 2965-2976 
2978 2981-2983 2996-2998 3009 3017-3020 
3026-3027 3031-3032 3082 3084-3085 3090 
3 096-3097 3 1 02-3 1 05 3 1 1 0 3 1 36-3 1 37 3 1 74 
3189-3191 3198-3199 3203 3208-3209 3212- 
3216 3219-3222 3229 3243 3260-3267 3312 
3315 3319-3322 3325 3328-3330 3354 3372- 
3373 3394-3398 3424-3427 3442 3462 3472- 
3473 3477-3478 3505-3506 3514 3524-3532 
3535-3543 3554-3556 3558-3560 3574-3580 
3586-3588 3590 3599-3601 3624-3625 3628 
3638-3640 3658-3663 3668-3669 3690-3691 
3693 3699 3701 3707 3723 3728 3747 3761 
3780-3781 3828 3864-3865 3871-3872 3891 
391 1-391 9 3955-3958 3966-3974 3997-3998 
4027 4030-4031 4040-4045 4048-4051 4055- 
4056 4068-4069 4071-4075 4077-4079 4081- 
4084 4087-4089 4098-4099 4124-4126 4131 
4137-4139 4157-4158 4220-4223 4230-4235 
4275-4277 4291-4295 4322 4329 4335-4337 
4383-4385 4400 4425 4461-4464 4481-4489 
4545-4548 4550 4554-4555 4558 4570-4571 
4581-4582 4628 4633-4634 4636-4637 4652- 
4653 4676 4776 4817 4907-4909 4946 4960 
5000-5003 5046-5047 5075-5077 5095-5097 
5100-5101 5116 5137-5140 5142-5143 5150- 
5152 5154-5157 5184-5187 5234 5241-5242 
5276-5279 5291 5310-5311 5414-5415 5445- 
5447 5452-5453 5485 5497 

infant 
brain 

Columbia 
University 

IB2003 

35 39-40 126 180-182 195-196 244 278-283 
296 341-344 350-352 388-390 400-404 413- 
414 551-553 557 561-562 583-590 675 794- 
796 832 855-856 863-869 900-902 906-908 
944 964-966 969 971-974 980-988 997-999 
1008-1009 1026-1027 1036-1037 1042-1043 
1049 1069 1089-1091 1095 1110-1113 1128- 
1130 1139-1142 1155 1173 1193 1202-1204 
1297 1315-1316 1334-1335 1349-1350 1353- 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




1354 1359 1383-1384 1394 1403-1404 1413 
1424 1427-1431 1470 1479 1488 1620 1638 
1640-1645 1648 1656 1685 1688-1689 1691- 
1692 1694-1698 1753 1755-1756 1765-1769 
1799-1804 1830 1848-1853 1870-1875 1896 
1903-1911 1915-1918 1952-1962 1978-1979 
2010-2013 2025-2030 2055-2056 2095-2097 
2144 2146-2147 2151-2152 2174-2176 2188- 
2190 2214-2215 2251-2252 2266 2276 2284 
2305-2306 2310 2313 2324 2365 2380-2381 
2391 2427 2432 2450 2473-2474 2483 2496 
2499 2521 2574-2575 2592 2594-2595 2657- 
2658 2665 2696-2697 2702-2705 2734 2754- 
2758 2774-2776 2813 2829 2842 2905 2931- 
2939 2965-2968 2996-2998 3090 3195 3212 
3215-3216 3220-3222 3224-3225 3257 3310- 
3313 3354 3466-3468 3477-3478 3505-3506 
3536-3543 3553 3576 3595-3596 3614 3638- 
3640 3658-3660 3663 3723 3737-3738 3742- 
3744 3831-3833 3864-3865 3941-3942 4040- 
4042 4093 4098 4124 4138-4139 4230-4231 
4243-4244 4252-4253 4291-4295 4335-4337 
4546-4548 4556-4557 4570-4571 4633-4634 
4784 4817 4907-4909 5001-5003 5046-5047 
5054 5137-5140 5260 

infant 
brain 

Columbia 
University 

IBM002 

133 491-492 682-684 743-744 889-897 1038- 
1040 1427-1431 1474 1580-1581 1633 1903- 
1911 2025-2026 2146-2147 2188 2310 2376 
2726 2787-2789 2904 2992 3212-3214 3224- 
3225 3325 3477-3478 3638-3640 3827 3966- 
3968 4055-4056 41 18 4131 4275-4277 4643 
5044 

infant 
brain 

Columbia 
University 

IBS001 

153 197 350-352 459 574-577 743-744 826- 
828 871-872 903-905 969 995 997-998 1042- 
1043 1068 1127 1151-1153 1287 1349-1350 
1559-1560 1694-1698 1773-1774 1848-1851 
1978-1979 2054-2056 21 1 1-21 14 2143 2200- 
2201 2365 2521 2664 2707-2708 2843 2945- 
2946 2978 3260-3267 3424-3427 3599-3601 
3624-3625 3737-3738 3891 4114-41164125 
4245 4291-4295 4635 4907-4909 

lung, 
fibroblast 

Stratagene 

LFB001 

137 313-315 435 491-492 579 822-823 885- 
886 910-914 944 950 961-962 994-995 997- 
998 1000-1002 1026-1027 1049-1050 1052 
1068 1075 1079 1110-1113 1129-1130 1143 
1172 1182-1185 1192 1202-1204 1266-1267 
1274 1287 1294-1296 1298-1299 1307 1315- 
1318 1351 1374 1395-1396 1400-1401 1407 
1411 1418-1420 1444 1455-1456 1473 1485 
1507 1516-1517 1532-1535 1547-1548 1553- 


Tissue 
origin 

RNA 
Source ■ 

Library 
Name 

SEQ ID NOS: 




1556 1611 1620 1648 1659-1660 1663 1688- 
1689 1691-1692 1701-1702 1708 1710-1714 
1719-1721 1735-1739 1758 1765-1769 1780- 
1783 1796-1798 1860 1915-1918 1925 1978- 
1979 1981-1983 1993-1998 2038-2039 2048 
2050-2054 2060-2061 2107 2118 2144 2156- 
2160 2179-2181 2204-2210 2223 2230 2234- 
2242 2251-2252 2254 2283 2318-2320 2331 
2349-2356 2388 2414-2415 2418 2499 2558- 
2559 2574-2575 2594-2595 2644-2647 2672- 
2674 2726 2772 2774-2776 2824-2826 2843 
2861-2862 2865-2868 2888 2898 2925-2929 
2960 2978 3 1 5 1 -3 1 52 3 1 99 3253-3255 3260- 
3267 3280-3282 3333-3335 3394-3395 3424- 
3427 3442 3466-3468 3505-3506 3561-3562 
3590 3619-3620 3628 3632-3634 3638-3640 
3673-3677 3680 3684 3735 3754 3810 3815- 
3816 3825 3840-3841 3853 3870 3882-3883 
3969-3971 4023 4055^-4056 4140-4141 4220- 
4223 4250-4253 4285-4286 4549-4550 4777- 
4780 4956-4957 4967-4970 5148-5149 5298- 
5302 5308-5309 5497 

lung 
tumor 

Invitrogen 

LGT002 

32-34 42 45-46 67 96 100-106 116-120 136- 
139 153 175-177 179 195-196 213-218 227- 
230 278-283 298-300 313-315 324-325 332- 
333 341-344 346-349 357-361 365 379-380 
393-394 400-404 420 436 474 478 483-485 
491-492 513-514 538 548-550 557 574-577 
580-581 605-607 611 652-653 682-684 699- 
701 709-711 723 728-733 743-744 764-771 
778-783 799-803 840-844 852-854 857-859 
873-875 885-886 889-897 909-920 924 946- 
956 964-966 969 971-979 992-995 997-1006 
1008-1009 1023-1027 1036-1040 1042-1043 
1048-1050 1063-1067 1088-1091 1110-1113 
1117-1119 1128-11301139-1143 1151-1153 
1155-1156 1160-1164 1172-1173 1185 1193 
1202-1212 1217-1218 1220-1221 1229-1232 
1235-1236 1244-1246 1269-1271 1300-1301 
1315-1318 1323-1328 1330-1333 1349-1355 
1357-1359 1363-1371 1374 1397 1403-1404 
1408-1409 1418-1423 1432 1434 1438 1440- 
1441 1446-1449 1463-1464 1467-1470 1473 
1480 1491-1494 1505 1507-1512 1533-1535 
1541-1548 1550-1557 1569-1574 1585 1591 
1611-1612 1617-1619 1621-1623 1625 1629- 
1632 1639-1645 1648 1654-1655 1657-1660 
1663-1664 1671-1673 1685 1687-1689 1691- 
1692 1694-1698 1706-1707 1710-1716 1719- 
1721 1727-1739 1743-1746 1763-1771 1780- 


Tissue 
origin 


WO 01/79449 



PCT/US01/08656 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 


1783 1807-1817 1830-1838 1852 1857-1859 
1870-1877 1882-1885 1903-1911 1913-1922 
1925 1948-1953 1964-1965 1981-1983 1992- 
1998 2000-2002 2004 2035 2038 2042-2043 
2048 2055-2056 2058-2061 2068-2071 2076- 
2082 2086-2087 2103-2104 21 16 21 18 2125- 
2133 2138-2145 2156-2160 2163-2168 2178- 
2181 2186-2187 2191-2195 2214-2221 2223- 
2228 2231-2232 2234-2242 2249 2254 2258- 
22602271 2278 2281-2283 2288-2291 2302 
2310 2315 2331 2333-2334 2338-2340 2377- 
2381 2391 2417-2418 2427 2452-2454 2465 
2468 2470-2472 2488 2497 2499-2502 2511- 
2513 2517 2519-2520 2536-2538 2542 2548 
2560-2562 2573 2579-2584 2604 2608-2616 
2626 2628-2631 2644 2665 2693 2696-2697 
2706 2734 2754-2758 2787-2789 2813 2819 
2836-2839 2843-2845 2848-2850 2854 2861- 
2862 2865 2873-2874 2886-2887 2896-2897 
2901-2903 2919-2920 2925-2929 2931-2939 
2951-2952 2954 2961-2962 2964-2973 2976 
3008-3009 3013-3014 3017-3020 3041-3049 
3081 3100 3102-3105 3138-3147 3150 3156- 
3162 3167-3168 3172-3173 3181-3183 3189- 
3191 3207 3210-321 1 3213-3214 3220-3222 
3229-3230 3240 3250-3255 3258-3259 3280- 
3283 3285 3289-3290 3310-3314 3323-3324 
3333-3338 3342-3344 3374-3382 3391 3396- 
3398 3403-3404 3422 3424-3427 3442-3443 
3462 3466-3468 3505-3506 3516-3517 3536- 
3543 3550-3552 3554-3556 3576 3590 3624- 
3625 3628-3630 3632-3634 3638-3640 3661- 
3662 3667 3673-3677 3684 3707 3723 3735 
3760-3761 3778 3780-3781 3787-3788 3791 
3811-3816 3822-3824 3835 3867-3869 3871- 
3872 3882-3883 3897-3905 3937-3938 3941- 
3942 3959-3960 3962 3969-3971 4014-4015 
4023 4040-4042 4060-4061 4068-4069 4071- 
4075 4082-4084 4100 4104-4105 41 14-4116 
4121-4123 4134 4136 4142-4143 4227-4228 
4230-423 1 4241-4244 4255-4257 4260 4272 
4275-4277 4289-4295 4322 4335-4337 4379- 
4380 4399 4451-4452 4534 4564 4583-4586 
4598-4601 4613 4622-4623 4649 4651-4653 
4668-4674 4740-4741 4764-4766 4777-4780 
4787 4796-4797 4807-4808 4862-4863 4867 
4884 4918-4920 4953 4956-4957 4967-4970 
5034-5036 5046-5047 5085-5088 5092 5117- 
5118 5137-5140 5142-5143 5148-5149 5151- 
5152 5214-5221 5241-5242 5255 5261-5267 


WO 01/79449 


Tissue 
origin 


RNA 
Source 


Library 
Name 


PCT/US01/08656 


SEQ ID NOS: 


5272-5274 5335-5343 5367-5368 5389-5391 
5427-5429 5431-5433 5438 5485-5488 5497 


lymphocy 
tes 


ATCC 


LPC001 


1-2 70 136 212-215 278-283 291 341-344 
388-390 450-451 462 551-553 558-560 602- 
603 607 641 652-653 675 743-744 755-757 
789-793 817 849-851 903-905 915 930 944 
951-953 992-993 995 999 1026-1027 1032- 
1034 1038-1040 1044-1047 1053-1054 1070- 
1071 10761089-1091 1117-11191139-1142 
1275-1277 1290-1291 1300-1301 1317-1318 
1330-1331 1349-1351 1355 1357-1358 1374 
1397 1411 1438 1444 1473 1491-1494 1507 
1516-1517 1539 1547-1548 1551-1552 1575- 
1576 1617-1619 1634-1637 1648 1656 1674- 
1682 1686 1719-1721 1727 1737-1739 1742- 
1746 1753 1758 1780-1783 1807-1817 1830 
1835-1838 1870-1877 1915-1918 1925 1978- 
1979 2017-2020 2025-2026 2038 2045-2047 
2052-2053 2072-2074 2086-2087 2095 2098- 
2100 21 11-21 14 21 16 2125-2133 2144 2146- 
2147 2174-2176 2234-2242 2255-2257 2267- 
2270 2275 2294-2295 2298-2299 2303-2304 
2310 2341 2347-2348 2386 2428 2432 2440- 
2441 2456 2467-2468 2499 2560 2644 2650 
2665 2672-2674 2709-2711 2728-2729 2741- 
2743 2760 2763 278i 2785 2807 2857-2860 
2873-2874 3009 3026-3027 3084-3085 3092 
3136-3137 3170-3171 3174 3199 3224-3225 
3258-3259 3269 3372-3373 3403-3404 3459- 
3460 3466-3468 3491-3495 3519 3536-3543 
3547 3554-3556 3561-3562 3588 3628-3630 
3664-3665 3691 3716 3791-3792 3807 3815- 
3816 3930 4068-4069 4118 4137 4228 4230- 
4235 4291-4295 4322 4341 4426 4498-4500 
4568 4597 4603 4643 4652-4653 4662 4678 
4740-4741 4796-4797 4907-4909 5085-5088 
5095-5097 5148-5149 5180 5188-5189 5268 
5448-54505467 


leukocyte 


GIBCO 


LUC001 


1-2 12-22 39-40 45-47 70 78 107-110 135- 
136 154-156 165 175-177212-215 223-226 
231 258-260 267-268 278-283 286-287 313- 
315 324-325 332-333 341-344 364 379-380 
388-390 394-395 415-416419 437-438 450- 
451 456 462 465 478-480 491-492 511 520- 
524 531-532 538 551-553 557 561-562 574- 
577 583-590 597-598 602-603 612-620 633- 
634 641 652-653 668-669 677-685 699-701 
704-711 719-722 728-733 743-744 754-757 
760 763-771 784-786 794-796 81 1-816 822- 
828 840-844 847 849-851 855-856 879-886 


WO 01/79449 PCT/US01/086S6 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




889-897 903-908 910-914 919-920 924-927 
930-936 944-945 950 954-960 963 969 971- 
988 990-995 997-1009 1017-1021 1023-1027 
1036-1040 1042-1049 1053-1054 1063-1067 
1070-1071 1075-1076 1079-1080 1089-1091 
1095 1110-1113 1117-1121 1128-1136 1139- 
1142 1154-1155 1159 1173 1175 1182-1184 
1192-1193 1198-1212 1220-1221 1225-1228 
1230-1232 1235-1236 1266-1267 1271 1275- 
12771290-1291 1297-1301 1315-13191330- 
1333 1349-1355 1357-1362 1374 1380 1383- 
1384 1389 1397-1399 1410-1411 1414-1417 
1422-1423 1432 1438 1440-1441 1444 1450- 
1470 1479-1485 1489 1491-1494 1497 1500 
1507-1512 1514 1516-1517 1521 1524 1532- 
1548 1550-1556 1567-1574 1582-1585 1588- 
15891591 15971603-1608 1611 1617-1619 
1625 1629-1632 1634-1637 1639-1648 1654 
1656 1659-1664 1666-1673 1683-1686 1688- 
1689 1691-1692 1694-1698 1701-1702 1706- 
1716 1719-1721 1723 1727-1739 1743-1746 
1753-1756 1763-1771 1773-1783 1796-1798 
1807-1817 1827-1852 1857-1860 1870-1877 
1879-1885 1898 1900-1911 1913-1922 1925- 
1927 1950-1962 1964-1974 1978-1979 1992 
2000-2002 2004-2007 2010-2013 2017-2024 
2027-2030 2035 2038-2039 2042-2043 2045- 
2047 2050-2056 2058-2064 2075-2077 2080- 
2082 2086-2091 2098-2100 21 1 1-21 14 21 16 
2118 2121-2142 2144-2147 2153-2168 2171- 
2172 2174-2176 2179-2183 2186-2187 2191- 
2195 2208-2210 2216-2221 2223-2242 2251- 
2252 2254 2262 2265 2267-2270 2283 2288- 
2291 2310 2327-2331 2382 2386 2388-2391 
2414-2415 2418 2421 2423-2424 2427-2428 
2440-2441 2447 2451-2456 2465 2467-2474 
2483 2492-2494 2497 2499 251 1-2513 2519- 
2520 2528-2531 2534-2535 2548-2556 2560- 
2562 2564-2571 2573-2578 25872591 2594- 
2595 2604 2626 2639-2642 2645-2647 2656 
2659-2662 2665 2672-2674 2702-2705 2719- 
2720 2728-2729 2741-2743 2747-2748 2754- 
2758 2760-2761 2774-2776 2781 2783 2803- 
2804 2809 2816-2820 2824-2826 2843 2852- 
2862 2865-2867 2871-2874 2888 2896-2897 
2901-2903 2914-2917 2925-2929 2945-2946 
2951-2952 2954 2961-2962 2965-2975 2990 
2996-2998 3008-3009 3013-3014 3023 3026- 
3027 3031-3032 3054 3080 3084-3085 3088 
3094-3095 3098-3099 3102-3105 3139-3149 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 


3158 3167-3168 3170-3173 3189-3191 3195 
3199-3202 3204 3206 3210-3211 3218 3220- 
3222 3224-3228 3230 3242-3243 3250-3252 
3257-3269 3276-3277 3280-3282 3285 3288- 
3290 3297-3299 3310-3312 3319-3322 3325- 
3327 3331-3340 3345-3346 3360 3394-3395 
3403-3404 3428-3429 3438-3442 3444-3449 
3456 3462 3466-3468 3486-3487 3491-3495 
3516-3517 3522 3536-3543 3548-3552 3558- 
3565 3576-3580 3583 3589 3591-3593 3599- 
3601 3605 3621-3625 3628-3630 3632-3635 
3658-3660 3668-3669 3680 3689 3691 3693 
3724-3725 3735 3737-3738 3747 3761 3778 
3780-3781 3783-3786 3791 3804-3807 3815- 
3816 3822-3824 3827 3837-3838 3845 3847- 
3849 3858-3859 3867-3869 3878-3879 3882- 
3883 3955-3960 3962 3966-3971 4023 4030- 
4031 4053-4056 4060-4061 4068-4069 4071- 
4076 4087-4089 4093 4104-4105 41 14-4116 
4126 4136-4137 4140-4144 4157-4158 4170- 
4178 4220-4223 4228 4230-4235 4241-4245 
4255-4257 4260 4273-4274 4285-4288 4291- 
4295 4322 4335-4337 4383-4384 4402 4404- 
4407 4430-4432 4534 4554-4555 4572 4577 
4583-4586 4598-4601 4604 4611 4613-4615 
4622-4626 4633-4634 4649 4651 4662 4668- 
4671 4740-4741 4748-4749 4756-4757 4764- 
4766 4772 4777-4780 4796-4797 4806 4832- 
4834 4862-4865 4900 4907-4909 4923-4925 
4948-4949 4953 4967-4970 4976-4979 5001- 
5003 5024 5038-5039 5082-5083 5092 5095- 
5097 5123-5124 5137-5140 5142-5143 5148- 
5149 5151-5152 5154-5157 5188-5192 5222- 
5224 5241-5242 5261-5267 5270 5272-5274 
5284-5289 5298-5302 5308-5311 5313-5315 
5329-5330 5335-5343 5374 5381-5382 5389- 
5391 5399-5401 5427-5429 5435-5437 5440- 
5442 5455 5460-5461 5464-5466 5497 


leukocyte 


Clontech 


LUC003 


1-2 48-50 154-156 195-196 286-287 313-315 
324-325 395 520-521 557 602-603 772 784- 
786 814-816 822-823 863-869 885-886 906- 
908 944 954-956 963 980-988 995 1050 1080 
1122 1129-1130 1182-1184 1192 1198-1201 
1317-1319 1348-1350 1353-1355 1357-1358 
1374 1432 1450 1507 1516-1517 1532-1535 
1547-1548 1664 1686 1715-1716 1737-1739 
1753 1814-1817 1857-1859 1888-1893 1903- 
1911 1919-1922 1950 19842010-2013 2035 
2038 2054 2058-2061 21 16 21 18 2125-2133 
2178 2191-2192 2223 2278 2572 2574-2575 


WO 01/79449 PCT/US01/086S6 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQIDNOS: 




2645-2647 2735 2774-2776 2816-2817 3009 
3041-3049 3076-3078 3130 3156-3157 3167- 
3168 3170-3171 3289-3290 3341 3403-3404 
3635 3908 3941-3942 4030-4031 4055-4056 
4275-4277 4691-4692 4756-4757 4796-4797 
5001-5003 5148-5149 5275 5383 5435-5437 
5497 

melanoma 
from cell 
line ATCC 
#CRL 
1424 

Clontech 

MEL004 

1-2 52 96 138-139 278-283 313-315 479-480 
491-495 51 1 799-800 822-823 829 847 863- 
869 871-875 889-897 944 951-953 957-962 
980-988 993 1017-1021 1038-1040 1042- 
1043 1129-1130 1172-1173 1182-1184 1202- 
1204 1220-1221 1237-1242 1269-1270 1290- 
1291 1337-1345 1359 1400-1401 1403-1404 
1432 1435-1436 1438 1442-1443 1457-1464 
1475-1477 1489 1505 1507 1524 1532 1536- 
1538 1547-1548 1551-1556 1575-1576 1585 
1603-1604 1611 1617-1619 1648 1663 1688- 
1689 1691-1692 1701-1702 1715-1716 1719- 
1721 1724 1735-1736 1746 1755-1756 1780- 
1783 1845-1847 1876-1877 1882-1885 1925 
1954-1962 1981-1983 2005 2045-2047 2058- 
2061 2088-2091 2115 2118 2138-2142 2144 
2178 2189-2190 2197-2199 2223 2254 2266 
2277 2281-2282 2284 2298-2299 2310 2347- 
2348 2389-2390 241 8 2424 2427 2440-2441 
2443 2510-2513 2548 2591 2597 2637 2659- 
2662 2781 2783 2814 2824-2826 2843-2845 
2857-2860 2898 2905 2945-2946 2955 2969- 
2973 3008 3029 3094-3095 3130 3166 3170- 
3173 3195-3196 3226-3228 3240 3258-3259 
3339-3340 3438-3441 3443 3459-3460 3574- 
3575 3577-3580 3589 3599-3601 3635 3658- 
3660 3691 3753 3815-3816 3828 3878-3879 
3941-3942 3966-3968 4077-4079 4104-4105 
4121-4122 4132-4133 4142-4144 4241-4242 
4275-4277 4287-4288 4326 4391-4393 4546- 
4548 4672-4674 4679 4737 4796-4797 4835 
4902 5055 5057-5065 5085-5088 5280 5308- 
5309 5389-5391 5421-5422 

mammar 
y gland 

Invitrogen 

MMG001 

1-2 12-13 39-40 47 62 81-82 96 116-119 126 
173 175-177 180-182 195-196 213-215 227- 
230 236 246-247 258-260 274 278-283 313- 
315 321 341-344 346-349 354 365-367 399 
419-420 445-446 450-451 478 491-492 520- 
521 538 543 580-581 583-590 602-603 607 
629-630 647-649 652-653 670-671 677-678 
682-684 697 709-71 1 728-733 743-744 764- 
771 789-793 796 801-803 806-808 814-816 
840-844 870 879-881 885-886 900-905 909- 


WO 01/79449 . PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




914 924-926 930 944 946-950 957-960 964- 
966 969 971-988 993 997-1002 1017-1021 
1023-1028 1032-1034 1038-1040 1042-1048 
1056-1057 1063-1068 1070-1071 1076 1088- 
1091 1108 1117-1119 1127 1131-1136 1139- 
1142 1151-1154 1159-1164 1175 1189-1191 
1193 1198-1201 1205-1207 1217-1218 1228 
1230-1232 1235-1242 1271 1297-1299 1307 
1315-1318 1320-1321 1323-1327 1337-1345 
1349-1350 1359-1362 1380 1385-1387 1389- 
1393 1400-1401 1403-1406 1408-1409 1411 
1414 1418-1421 1425 1427-1431 1435-1436 
1455-1462 1471-1472 1481-1484 1488 1491- 
1494 1498-1499 1507-1512 1524 1533-1535 
1540-1548 1550 1575-1576 1579 1585 1588- 
1589 1603-1608 1621-1623 1629-1632 1634- 
1637 1640-1647 1649-1653 1655 1665 1691- 
1692 1694-1698 1709 1715-1716 1727-1734 
1737-1739 1743-1746 1753 1758 1765-1771 
1773-1774 1780-1783 1835-1844 1848-1852 
1857-1860 1870-1875 1878-1885 1897 1900- 
1911 1919-1922 1925 1951 1954-1962 1964- 
1965 1978-1979 1981-1983 1990-1991 2010- 
2013 2025-2034 2036-2038 2045-2047 2050- 
2054 2060-2061 2068-2074 2076-2079 2095 
2101-2102 2111-2114 2117-2118 2143 2146- 
2147 2151-2152 2174-2176 2186-2187 2189- 
21902193-2201 2204-22102214-2221 2234- 
2242 2254 2258-2260 2267-2270 2277 2281- 
2282 2284 2288-2291 2296-2297 2305-2306 
2310 2331 2333-2334 2338-2340 2345-2346 
2377-2381 2388 2391 2395 2397-2399 2405- 
241 5 2428 2452-2454 2468 2473-2474 2483 
2496 251 1-2513 2526-2527 2536 2560 2574- 
2575 2579-2584 2587 2592 2594-2595 2603- 
2604 2629-2631 2633-2636 2639-2642 2656- 
2662 2664-2665 2672-2676 2692 2696-2697 
2721 2728-2729 2734 2741-2743 2782 2787- 
2789 281 1-2814 2828 2836-2839 2843 2855- 
2860 2886-2887 2899-2900 2905 2925-2929 
2951-2952 2954 2961-2962 2976 2978 2981- 
2983 2985 2990 2996-2998 3008-3009 3013- 
3014 3030-3032 3118 3131 3136-3137 3149 
3156-3157 3172-3173 3189-3191 3195 3199 
3219 3226-3228 3243 3246 3250-3255 3257- 
3259 3297-3299 3310-3311 3313 3323-3325 
3331-3338 3396-3398 3403-3404 3411-3417 
3424-3427 3462 3466-3468 3472-3473 3498- 
3501 3516-3517 3536-3543 3550-3552 3554- 
3556 3561-3562 3577-3580 3624-3625 3638- 


WO 01/79449 PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ IDNOS: 




3640 3658-3660 3664-3665 3667 3691 3713 
3729-3730 3760 3784-3788 3853 3871-3872 
3930 3939-3940 3955-3957 3997-3998 4004 
4014-4015 4033-4034 4036-4039 4043-4045 
4060-4061 4068-4069 4077-4079 4081-4084 
4093 4098 4112-41 13 41 18 4142-4143 4220- 
4223 4239-4242 4261-4266 4275-4277 4283- 
4284 4291-4295 4297-4299 4330-4331 4335- 
4337 4379-4380 4391-4393 4403 4461-4464 
4531 4534 4549 4551 4558 4569-4571 4583- 
4586 4598-4601 4629-4634 4642 4651 4672- 
4674 4691-4692 4796-4797 4861-4865 4967- 
4970 4976-4979 5034-5036 5057-5065 5092 
5095-5097 5116 5137-5140 5148-5149 5190- 
5192 5222-5224 5241-5242 5261-5267 5272- 
5274 5276-5277 5313-5315 5329-5330 5335- 
5343 5387-5388 5408-5410 5423-5425 5439 
5455 5485 5497 

induced 

neuron 

cells 

Strategene 

NTD001 

313-315 548-553 682-684 814-816 903-905 
909 944 950 969 993 1017-1021 1026-1027 
1036-1037 1070-1071 1.088 1285 1294-1296 
1315-1316 1322 1357-1358 1363-1371 1403- 
1404 1520 1536-1538 1620 1629-1632 1638 
1663 1701-1702 1707-1708 1724 1753 1770- 
1771 1915-1918 1925-1927 1950 1993-1998 
2017-2020 2025-2026 2058-2061 2083 2121- 
2124 2144 2151-2152 2197-2199 2331 2386 
2469 2573 2596 2628 2752-2753 2843 2898 
2925-2929 2961-2962 2969-2973 3172-3173 
3189-3191 3224-3225 3253-3255 3310-3311 
3428-3429 3470-3471 3673-3677 3760 3969- 
3971 4014-4015 4082-4084 4090-4092 4100 
41 14-41 16 4140-4141 4272 4285-4286 4470 
4616-4621 4691-4692 4761-4763 4864-4865 
4907-4909 5137-5140 5298-5302 

retinoic 

acid 

induced 

neuronal 

cells 

Strategene 

NTR001 

195-196 278-283 388-390 743-744 855-856 
995 1038-1040 1139-1142 1418-1420 1533- 
1535 1780-1783 1903-1911 2060-2061 2223 
2592 3289-3290 3969-3971 4598-4601 

neuronal 
cells 

Stratagene 

NTU001 

74-77 195-196 246-247 278-283 294-295 
341-344 388-390 491-492 566 652-653 680- 
681 743-744 755-757 784-786 801-803 855- 
856 863-869 900-902 919-920 950 964-966 
995 997-998 1000-1002 1076 1159 1235- 
1236 1294-1297 1432 1451-1454 1507 1533- 
1535 1605-1608 1648 1667-1670 1688-1689 
1691-1692 1694-1698 1703-1704 1746 1753 
1765-1769 1831-1834 1848-1851 1900-1902 
1925 1966-1974 2060-2061 2088-2091 2095 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




2144 2223 2254 2276 2310 2573 2598 2628 
2677-2680 2752-2753 2824-2826 2913 3149 
3229 3486-3487 3628 3638-3640 3864-3865 
3969-3971 4215-4217 4285-4286 4583-4586 
4910 5001-5003 5137-5140 5211-5213 5423- 
5425 

pituitary 
gland 

Clontech 

PIT004 

175-177 227-230 491-492 796 822-823 849- 
851 992 995 1017-1021 1042-1043 1160- 
1164 1182-1184 1202-1204 1215-1216 1220- 
1221 1300-1301 1317-1318 1320-1321 1398- 
1399 1410 1479 1507 1540 1553-1556 1582- 
1585 1591 1625 1648 1657-1658 1663 1708 
1753 1870-1875 1925 2060-2061 2068-2071 
2118 2447 2604 2696-2697 2715 2774-2776 
2843 2871-2872 3021-3022 3100 3203 3331- 
3332 3339-3340 3424-3427 3577-3580 3684 
3787-3788 3959-3960 4549 5431-5433 

placenta 

Clontech 

PLA003 

1052 1215-1216 1694-1698 1919-1922 2116 
'3969-3971 4672-4674 5001-5003 5241-5242 

prostate 

Clontech 

PRT001 

42 165 246-247 335 51 1 548-550 675 847 
873-875 879-881 889-897910-914 946-949 
980-988 993 995 1008-1009 1038-1040 1049 
1128 1154 1157 1173 1182-1184 1202-1204 
1297 1317-1318 1352 1357-1359 1398-1399 
1414 1457-1462 1485 1498-1499 1524 1553- 
1556 1629-1632 1648 1683-1684 1688-1689 
1718-1721 1746 1753 1770-1771 1792-1795 
1831-1834 1860 1870-1875 1879-1881 1925- 
1927 1990-1991 2005 2035 2038 2045-2047 
2055-2056 2060-2064 2083 2088-2091 2118 
2144 2179-2181 2202-2203 2229 2254 2277 
2283 2296-2297 2303-2304 2315 2337-2340 
2387 2418 2423 2427 2445 2456 2468 2475 
2492-2495 251 0 2536 2543-2544 2546 2549- 
2550 2574-2575 2591-2592 2604 2645-2647 
2649 2659-2662 2712 2721 2749-2750 2760 
2857-2860 2871-2872 2894-2895 2909-291 1 
2925-2929 2951-2952 3005 3013-3014 3017- 
3020 3029 3081 3100 3121 3148 3174 3199 
3226-3228 3242 3250-3252 3276-3277 3280- 
3282 3558-3560 3606-3607 3694-3696 3765 
3891 3962 4067 4101-4102 4232-4235 4383- 
4384 4461-4464 4533 4546-4548 4581-4582 
4796-4797 4882 4886 4899 4907-4910 4967- 
4970 5073 5078 5295-5296 5431-5433 

rectum 

Invitrogen 

REC001 

39-40 61 64 278-283 298-300 491-492 561- 
562 652-653 789-793 870 879-881 957-960 
997-998 1007 1042-1043 1070-1071 1131- 
1136 1159 1217-1218 1235-1242 1272-1273 
1275-1277 1317-1319 1363-1371 1380 1386- 


WO 01/79449 


Tissue 
origin 


RNA 
Source 


Library 
Name 


PCT/US01/08656 


SEQ ID NOS: 


1387 1444 1446- 
1540 1547-1548 
1692 1694-1698 
1851 1860 1870 
1948-1949 1981 
2076-2077 2098 
2200-2201 2204 
2526-25272553 
2631 2675-2676 
2718 2844-2845 
3036 3213-3214 
3311 3336-3338 
3590-3593 3619> 
3793 3815-3816 
4116 4404-4407 
4903 4910 4946 
5097 5120 5272- 
5429 5431-5433 


■1449 1507 
1553-1556 
1715-1716 
•1875 1903- 
-1983 2005 
-2100 2153. 
-22072315 
2576-2578 
2696-2697 
2865 2931- 
3220-3222 
3462 3516- 
■3620 3624- 
3969-3971 
4668-4671 
5001-5003 
•5274 5322 


1511-1512 1532 
1667-1673 1691- 
1728-1736 1848- 
■1911 1915-1918 
2038 2054-2056 
-2155 2191-2192 
2317 2380-2381 
2587 2596 2629- 
2702-2705 2716- 
■2939 3008 3035- 
3250-3252 3310- 
■3517 3536-3545 
-3625 3629-3630 
4071-4076 4114- 
4862-4863 4877 
5034-5036 5095- 
5399-5401 5427- 


salivary 
gland 


Clontech 


SAL001 


48-50 116-119 154-156 175-177 313-315 
396-398 491-492 543 591 784-786 826-828 
910-914 924 950 994-995 999 1023-1025 
1036-1037 1049 1128 1202-1204 1230-1232 
1237-1242 1297 1322 1332-1333 1352 1418- 
1420 1446-1449 1473 1480-1484 1498-1499 
1507 1511-1512 1533-1535 1541-1546 1667- 
1670 1686 1746 1763-1769 1792-1795 1839- 
1844 1857-1859 1865-1875 1882-1885 1919- 
1922 1948-1949 1951 1978-1979 2017-2020 
2055-20562118 2125-2130 2138-2142 2146- 
2147 2179-2181 2251-2252 2255-2257 2273 
2280 2286-2287 2395 2403 2405-2413 2423 
2499 2536 2591 2629-2631 2700 2712 2781 
2784 2843-2845 2855-2856 2898 2965-2968 
3008 3021-3022 3075 3236 3280-3282 3319- 
3322 3462 3491-3495 3632-3634 3778 3867- 
3869 3966-3968 4291-4295 4333 4581-4582 
4598-4601 4681-4683 4729 4953 5001-5003 
5148-5149 5270 5272-5274 5406-5407 5464- 
5466 


salivary 
gland 


Clontech 


SALs03 


341-344 1089-1091 1435-1436 1511-1512 
1664 1708 4907-4909 5272-5274 


skin 

fibroblast 


ATCC 


SFB001 


491-492 1089-1091 1182-1184 1685 2005 
2223 5423-5425 


skin 

fibroblast 


ATCC 


SFB002 


175-177 1089-1091 1182-1184 1688-1689 
1763-1764 3289-3290 5423-5425 


skin 

fibroblast 


ATCC 


SFB003 


366-367 840-844 1089-1091 1557 1688-1689 
2005 3313 5423-5425 


small 
intestine 


Clontech 


SIN001 


154-156 179 191 201-205 212 277 341-344 
357-361 435 457 652-653 698 873-875 944 
969 997-999 1032-1034 1048 1063-1067 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQIDNOS: 




1078 1128 1131-1136 1139-1142 1154 1182- 
1184 1323-1327 1359 1451-1454 1511-1512 
1540 1550-1552 1626 1634-1637 1640-1645 
1655 1663 1688-1689 1703-1704 1710-1714 
1805-1806 1827-1829 1839-1844 1964-1965 
1981-1983 2004-2005 2055-2056 2068-2071 
2086-2087 2096-2097 2156-2160 2234-2242 
2311-2313 2321 2338-23402385 25102536 
2603-2604 2628 2665 2748 2840-2841 2843 
2951-2952 2954 2964 2969-2973 3100 3151- 
3 1 52 3 1 8 1 -3 1 83 3 1 93 3309-33 1 1 333 1 -3332 
3403-3404 3470-3471 3590 3689 3729-3730 
3810 3878-3879 3893 3941-3942 3959-3960 
3973-3983 4004 4062-4064 4239-4240 4382 
4560 4668-4671 4796-4797 4864-4865 4890- 
4891 4910 5001-5003 5057-5065 5080 5148- 
5149 5272-5274 5335-5343 5399-5401 5435- 
5437 5444-5447 

skeletal 
muscle 

Clontech 

SKM001 

1-2 154-156 175-177 216-218 245 313-315 
346-349 354 574-577 849-851 928-929 957- 
960 971-974 1003-1006 1032-1034 1076 
1300-1301 1334-1335 1395-1396 1403-1404 
1432 1550 1691-1692 1735-1739 1746 1830 
2049 2118 2548 2560 2592 2629-2631 2746 
2785 28192843 3121 3181-3183 3310-3311 
3432 3663 3737-3738 3943-3948 4534 5095- 
5097 

skeletal 
muscle 

Clontech 

SKM002 

1688-1689 2234-2242 2288-2291 3795-3800 
5423-5425 

skeletal 
muscle 

Clontech 

SKMs03 

1688-1689 3795-3800 

skeletal 
muscle 

Clontech 

SKM0s4 

1585 3536-3543 3795-3800 5154-5157 

spinal 
cord 

Clontech 

SPC001 

30-31 74-77 123 134 154-156 175-177 213- 
215 301-303 313-315 421 491-492 520-521 
751 796 822-823 849-851 855-856 863-869 
871-872 889-897 909 924 927 950-953 964- 
966 980-988 997-998 1017-1021 1026-1027 
1049 1053-1054 1089-1091 1127 1151-1154 
1159 1173 1175 1182-1184 1189-1191 1215- 
1216 1220-1221 1230-1232 1319-1321 1349- 
1350 1355 1359 1363-1371 1388-1393 1398- 
1399 1422-1423 1432 1446-1449 1470 1532 
1539 1541-1546 1551-1552 1569-1574 1582- 
1584 1649-1653 1663 1685 1688-1689 1707 
1715-1716 1727 1735-1736 1792-1798 1831- 
1834 1839-1844 1915-1922 1951 1992 2005 
2017-2020 2035 2042-2043 2057 2060-2061 
2072-2074 2086-2087 2096-2097 21 18 2143- 
2144 2173 2186-2187 2193-2195 2208-2210 


WO 01/79449 PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




2223 2230 2233-2242 2246-2248 2273 2278 
2380-2381 2419-2420 2488 2511-2513 2536 
2594-2595 2650 2659-2662 2702-2705 2747 
2749-2750 2759 2761 2774-2776 2820 2843- 
2845 2865 2899-2900 2954 2960-2962 2974- 
2975 2992 2996-2998 3021-3022 3203 3289- 
3290 3312 3342-3346 3402 3428-3429 3536- 
3543 3550-3552 3561-3562 3574-3575 3588 
3636 3729-3730 3792 3804-3806 3810 3831- 
3833 3840-3841 3855 3882-3883 4036-4039 
4048-4051 4093 4104-4105 4114-4116 4134 
4145-4153 4220-4223 4227 4232-4235 4272 
4373 4381 4409 4534 4546-4548 4629-4632 
4738-4739 4796-4797 4862-4865 4907-4909 
5001-5003 5057-5065 5085-5088 5137-5140 
5188-5192 5276-5279 5421-5425 5427-5429 
5445-5447 5464-5466 5489 

adult 
spleen 

Clontech 

SPLcOl 

491-492 548-550 561-562 641 652-653 801- 
803 863-869 944 969 971-974 995 1 128 1 171 
1205-1207 1271 1290-1291 1330-1331 1353- 
1354 1508-1510 1547-1548 1551-1552 1605- 
1608 1621-1623 1625 1694-1698 1743-1745 
1753 1796-1798 1827-1829 1848-1851 2054 
2066-20672098-2100 2111-2114 2179-2181 
2193-2195 2537-2538 2604 2925-2929 3017- 
3020 3234 3240 3250-3252 3289-3290 3402 . 
3536-3543 3667 3975-3983 4114-4116 4136 
4549 4652-4653 4691-4692 4796-4797 4907- 
4909 5001-5003 5050-5052 5144 5241-5242 
5270 5335-5343 5346-5354 5389-5391 

stomach 

Clontech 

STO001 

47 134 154-156286-287 394 440-441 468 
707 754 950-953 961-962 995 1041 1050 
1070-1071 1075 1160-1164 1182-1185 1195- 
1196 1256 1414 1507 1511-1512 1524 1638 
1648 1664 1674-1682 1687 1724 1746 1780- 
1783 1819 1952-1953 2093 2118 2121-2124 
2188 2216-2221 2234-2242 2251-2252 2258- 
2260 2273 2424 2464 251 1-2513 2522 2548 
2626 2645-2647 2650 2664 2675-2676 2686 
2726 2820 2842 2898 2957 3008 3 1 14 3 1 72- 
3173 3197 3258-3259 3285 3310-331 1 3374- 
3382 3428-3429 3456 3508-3513 3584-3585 
3693 3882-3883 3906 3969-3971 4241-4242 
4400 4498-4500 4672-4674 4910 4967-4970 
5431-5433 5497 

thalamus 

Clontech 

THA002 

14-22 52 70 96 131-132 154-156 235 296 
313-315 354 400-404 436 551-553 709-711 
822-823 829 964-966 969 997-998 1026-1027 
1038-1040 1044-1047 1051 1129-1130 1154 
1175 1182-1184 1193 1244-1246 1249-1255 


WO 01/79449 PCTAJS01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ IDNOS: 




1271 1281-1282 1290-1291 1320-1321 1348 
1360-1362 1390-1394 1407 1427-1431 1470 
1507 1540 1585 1592 1620-1623 1640-1645 
1683-1684 1694-1698 1709 1735-1736 1839- 
1844 1870-1875 1897 1925 1948-19492010- 
2013 2025-2026 2031-2034 2072-2074 2076- 
2077 2202-2203 221 1-2213 2255-2257 2261 
2298-2299 2322 2349-2356 2419-2420 2437 
2470-2472 2603 2637 2650 2702-2705 2754- 
2758 2765 2784 2857-2860 2951-2952 3025 
3139-3147 3167-3168 3230 3237 3246 3288 
3358 3424-3427 3477-3478 3516-3517 3554- 
3556 3729-3730 3769 3831-3833 3877 3891 
3969-3971 4093 4098 4127-4130 4220-4223 
4283-4284 4291-4295 4581-4582 4598-4601 
4672-4674 4738-4739 5073 5188-5189 5202- 
5207 5399-5401 5423-5425 5440-5441 

thymus 

Clontech 

THM001 

28 39-4042 52 125 137 157-159 165 175-177 
198 235 274 277 284 366-367 394 450-451 
491-492 499 516 583-590 605-606 659-660 
707-71 1 764-771 822-823 840-844 847 852- 
854 863-869 899 944 950-953 980-988 997- 
999 1017-1021 1026-1027 1075-1076 1080 
1131-1 136 1 139-1 142 1 173-1 174 1 182-1 184 
1202-1204 1230-1232 1290-1291 1308-1309 
1359 1380 1389 1397 1410 1414 1418-1423 
1434 1444 1450 1470 1479 1485 1507 151 1- 
1512 1516-1517 1524 1551-1557 1569-1574 
1597 161 1 1617-1619 1659-1660 1663 1686 
1709-1714 1719-1721 1727 1746 1753 1763- 
1764 1792-1795 1827-1829 1857-1859 1876- : 
1877 1879-1881 1915-1922 1926-1927 1954- 
1962 2000-2002 2031-2034 2038 2049 2054 
2060-2061 2098-2100 2118 2125-2133 2138- 
2142 2145 2148-2150 2153-2160 2191-2192 
2214-2215 2246-2248 2254-2257 2267-2270 
2273 2280 2284 2298-2299 2301 2307 2338- 
2340 2427 2456 2468 2490-2491 2536 2542 
2561-2562 2604 2730 2739 2752-2758 2820 
2843 2866-2867 2873-2874 2913-2917 2919- 
2920 2954 2974-2975 3009 3025 3035-3036 
3088 3094-3095 3117 3149 3170-3171 3210- 
3211 3226-3229 3235 3238 3250-3255 3283 
3289-3290 3314 3342-3344 3428-3429 3508- 
3513 3591-3593 3605 3608 3624-3625 3632- 
3634 3636 3689 3691 3723 3772 3778 3780- 
3781 3784-3786 3815-3816 3864-3865 3882- 
3883 3891 3897-3905 3925-3926 3958 3962 
4093 4100 41 12-41 16 4126-4130 4228 4287- 
4288 4581-4582 4598-4601 4652-4653 4662 


WO 01/79449 PCT/US01/08656 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




4796-4797 4839 4910 5000-5003 5137-5140 
5148-5149 5190-5192 5272-5274 5317-5320 
5384 5483 

thymus 

Clontech 

THMc02 

39-40 52 74-77 92-96 136 154-159 168 175- 
177 244 258-260 301-303 316-320 365-367 
400-404 462 471 491-492 498 516 522-524 
531-532 551-553 557 602-603 607 647-649 
670-671 697 699-701 709-711 728-733 784- 
786 822-823 829 833 840-844 863-869 885- 
886 925-926 931-936 944 950 971-974 993 
995 997-999 1003-1006 1017-1021 1042- 
1047 1070-1071 1075 1110-1113 1128 1131- 
1136 1171 1182-1184 1192 1202-1207 1271 
1275-1277 1315-1319 1322 1332-1333 1357- 
1359 1363-1371 1389 1398-1401 1405-1407 
1432 1440-1441 1446-1449 1455-1462 1467- 
1469 1479 1507-1510 1524 1526 1533-1535 
1540 1551-1552 1569-1574 1588-1589 1617- 
1619 1634-1637 1646-1647 1656 1694-1698 
1701-1702 1707 1715-1716 1727 1743-1746 
1754 1763-1764 1792-1795 1831-1834 1839- 
1844 1848-1851 1857-1860 1870-1877 1879- 
1881 1903-1911 1913-1918 1952-1962 1966- 
1974 1981-1983 2010-2013 2017-2024 2048 
2052-2053 2060-2061 2072-2074 2080-2082 
2086-2087 2098-2100 2131-2133 2138-2142 
2148-2150 2153-2160 2178 2191-2192 2196 
2208-2210 2214-2221 2230 2234-2242 2249 
2286-2287 2331 2338-2340 2360 2388 2391 
2464 25 1 1 -25 1 3 25 1 9-2520 2537-2538 2604 
2645-2647 2651-2655 2657-2658 2672-2674 
2677-2680 2737-2738 2741-2743 2781 2829 
2846-2847 2896-2897 2901-2903 2918 2976 
3009 3068 3124-3128 3138 3196 3215-3216 
3220-3222 3230 3240 3250-3252 3274 3289- 
3290 3299 3310-331 1 3331-3332 3394-3395 
3403-3404 3406-3407 3459-3460 3466-3468 
3535-3543 3554-35563591-3593 3654-3657 
3729-3730 3737-3738 3768-3769 3795-3800 
3817-3821 3846 3867-3872 3878-3879 3882- 
3883 3925-3926 3969-3971 3975-3983 4100 
4106 4285-4288 4291-4296 4326 4343-4347 
4360 4376-4377 4439 4529 4534 4542-4544 
4581-4582 4598-4601 4613-4615 4622-4623 
4629-4632 465 1 4657 4660 4672-4674 4729 
4747-4749 4796-4797 4864-4865 4903 4907- 
4909 5001-5003 5046-5047 5130-5131 5148- 
5149 5210 5241-5242 5261-5267 5276-5277 
5298-5302 5313-5315 5322 5329-5330 5332- 
5333 5335-5343 5346-5354 5421-5425 5440- 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




5442 

thyroid 
gland 

Clontech 

THR001 

1-2 47 62 70 74-78 100-106 134 136 138-139 
154-156 175-177 185 191 197 222 231 237- 
240 252-253 278-283 313-315 332-333 341- 
344 357-361 365 379-380 394 400-404 415- 
416 419 437-438 463 491-492 51 1 513 574- 
577 583-590 631 652-653 670-671 685 699- 
701 704-707 728-733 796 822-823 840-844 
847 863-870 889-898 903-908 910-914 916- 
918 927-929 931-936 944 951-953 969 971- 
974 980-988 992-995 997-999 1003-1006 
1008-1009 1017-1021 1032-1034 1036-1037 
1049 1052-1054 1056-1057 1063-1067 1070- 
1071 1075 1079 1110-1113 1117-1121 1128- 
1136 1154 1172-1173 1175 1180-1187 1198- 
1204 1217-1218 1220-1223 1228 1235-1236 
1243-1246 1249-1255 1266-1267 1269-1271 
1275-1277 1286 1297 1300-1301 1307 1310- 
1319 1323-1327 1332-1333 1349-1350 1353- 
1355 1359-1362 1374 1386-1387 1389-1393 
1395-1399 1403-1404 1412 1414-1420 1427- 
1431 1438 1440-1444 1446-1449 1455-1456 
1463-1464 1470 1473 1479-1480 1488 1507- 
1510 1520 1524 1536-1538 1547-1548 1551- 
1552 1558 1569-1574 1582-1584 1586-1589 
1611-1612 1617-1620 1639-1645 1648 1657- 
1658 1663-1665 1667-1670 1683-1684 1686 
1691-1692 1701-1702 1707 1715-1716 1723 
1735-1739 1746 1753 1755-1756 1765-1771 
1773-1774 1780-1783 1792-1798 1805-1813 
1827-1834 1839-1844 1848-1852 1870-1877 
18971903-1911 1915-1918 1925-1927 1951 
1954-1962 1964-1974 1999-2003 2005 2010- 
201 3 201 7-2020 2025-2026 2036-2038 2042- 
2043 2045-2048 2050-2059 2062-2064 2066- 
2071 2075 2083 2086-2091 2093 2101-2102 
21 1 1-21 14 21 16 21 18 2125-2133 2143-2144 
2156-2160 2163-2168 2173-2176 2179-2181 
2186-2187 2200-2210 2223 2230 2253-2260 
2262 2267-2270 2273 2288-2292 2296-2297 
2303-2304 2327-2331 2358 2377-2379 2386 
2418 2421 2423 2427 2434-2435 2444 2449 
2452-2454 2467 2496 2502 2510-2513 2534- 
2536 2549-2550 2554-2556 2564-2571 2573- 
2575 2598 2604 2626 2629-263 1 2645-2648 
2650-2655 2657-2662 2672-2676 2686 2700 
2702-2706 2709-271 1 2726 2741-2743 2746- 
2748 2760-2761 2763 2772 2777-2778 2805- 
2806 2813-2814 2818 2828 2833 2843 2852- 
2853 2861-2862 2866-2867 2898-2900 2905 


WO 01/79449 


PCT/US01/08656 


Tissue 
origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 


2913 2925-2929 2945-2946 2965-2973 2992 
3008 3010-3012 3021-3022 3024 3084-3085 
3088 3094-3095 3123 3131 3133-3135 3138 
3153-3158 3170-3171 3189-3191 3195 3210- 
3212 3218 3220-3222 3226-3228 3240 3242- 
3243 3256 3258-3259 3279-3282 3288 3297- 
3299 3313 3319-3322 3325 3331-3335 3342- 
3346 3372-3382 3399 3408 3418 3424-3429 
3438-3441 3444-3445 3456 3466-3468 3474 
3477-3478 3516-3517 3522 3524-3532 3535 
3544-3545 3554-3556 3558-3562 3577-3580 
3583 3586 3589 3591-3593 3602-3605 3610- 
3613 3628 3638-3640 3658-3660 3673-3677 
3680 3685 3691 3693 3708 3724-3725 3747 
3762 3791-3792 3804-3807 3815-3816 3822- 
3824 3867-3869 3871-3872 3886 3891 3895 
3908 3930 3949-3951 3962 3966-3971 4004- 
4007 4014-4015 4024-4025 4033-4034 4043- 
4045 4093 4100 4104-4105 4109-4111 4123 
4126 4140-4141 4169 4220-4223 4230-4235 
4241-4244 4275-4277 4379-4380 4383-4385 
4435-4437 4461-4464 4520 4522 4537-4544 
4568 4581-4582 4598-4601 4633-4635 4640 
4681-4683 4691-4692 4764-4766 4785 4796- 
47974864-4865 4873 4890-4891 4907-4910 
4980 5085-5088 5092 5107-5108 5147-5149 
5154-5157 5241-5242 5280 5308-5309 5329- 
5330 5335-5343 5369 5389-5391 5399-5401 
5406-5407 5423-5425 5427-5429 5442 5448- 
5450 5464-5466 5497 


trachea 


Clontech 


TRC001 


1-2 39-40 52 231 288-290 306 379-380 51 1 
822-823 889-897 909 951-953 963 990-991 
1026-1027 1052 1110-1113 1129-1130 1182- 
1184 1272-1273 1292-1293 1297 1300-1301 
1307 1349-1350 1352 1363-1371 1397 1440- 
1441 1457-1462 1511-1512 1532 1547-1548 
1586-1587 1612 1648 1664 1667-1670 1687 
1690 1708 1735-1736 1746 1770-1771 1876- 
1877 1900-1902 1948-1949 1951-1953 2000- 
2002 2004 2021-2024 2036-2037 2054-2056 
2060-2064 2118 2422 2452-2454 2470-2474 
2511-2513 2604 2659-2662 2681-2685 2748 
2879-2881 2898 2925-2929 2974-2975 3026- 
3027 3170-3171 3223 3242 3260-3267 3394- 
3395 3446-3449 3456 3663 3673-3677 3686- 
3688 3761 3969-3971 4014-4015 4140-4141 
4275-4277 4477 4554-4555 4570-4571 4664- 
4665 4761-4763 4864-4865 4878-4879 4892 
5241-5242 5272-5274 5438 


uterus 


Clontech 


UTR001 


116-119 137-139 278-283 313-315 379-380 


WO 01/79449 PCT/US01/086S6 


Tissue 
origin 

RNA 
Source 

Library 
Name 

SEQ ID NOS: 




491-492 548-550 583-590 592-594 789-793 
814-816 822-823 930 995 999 1050 1068 
1143 1202-1207 1230-1232 1297 1323-1327 
1351 1363-1371 1383-1384 1388 1425 1438 
1451-1454 1507 1551-1552 1582-1584 1627 
1663 1688-1689 1691-1692 1719-1721 1746 
1753 1755-1756 1765-1769 1792-1795 1839- 
1844 1878 1919-1922 1951 1988 2017-2024 
2045-2047 2055-2056 21 18 2193-2195 2208- 
2210 2254 2273 2296-2297 2444 2469 2552 
2604 2665 2696-2697 2768-2771 2781 2802 
2861-2862 2955 3156-3157 3419 3451-3455 
3577-3580 3708 3729-3730 3749-3752 3880 
3934 3966-3968 4043-4045 4062-4064 4239- 
4240 4374-4375 4629-4632 4666 4796-4797 
5024 5148-5149 5181-5183 5389-5391 5485 


♦The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA 
(Invitrogen), 2) normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 
4) normal fetal brain mRNA (Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal 
5 fetal liver mRNA (Invitrogen), 7) normal fetal skin mRNA (InvitrogenX 8) human adrenal gland 
mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 10) human leukemia lymphablastic 
mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lymph node mRNA 
(Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (ClontechX 15) 
human esophagus mRNA (BioChain), 1 6) human conceptional umbilical cord mRNA (BioChain). 


10 


WO 01/79449 



PCT/US01/08656 


Table 2 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
nmtno acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno>vn; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 

1 

5498 

C 

1 

239 

322 

MGGALLKEP1LSPGGGKGKIFFWGP 
QN* 

2 

5499 

A 

2 

1441 

2129 

SV1A*SCRASVASKQS*PTLLPSACA 

RPHA\STVDAPASGGAPRASSP\SSD 

CLWSTSSSSTPLSASASSS/SPPSFNP 

A ADARG SQGPG ARGRSCSPSSSERH 

VRRRVSAARQAGAASAGGGRQAG 

LAGRSGLSA/SRSSARASSSATPALA 

QST\PSSESECAPLKSRSGLTSSLSKP 

AS*ATLGKKGSGSSWRFPPES1HGR 

HPLSASCWNKSVAAAAAPTGATAP 

PKAGP 

3 

5500 

C 

3 

36 

236 

MGPTIPDXSXFFWRKPITWMPTWE 

GTSNVGPQPLSSSKSLHSXRGHPAPI 

PTGQAGPRDSGPGASP* 

4 

5501 

A 

4 

109 

300 

GGGKQIPFKGGKFKWGPGPVLKKG 
EREKPGGNPKKTPWKKASSRPAPRI 
HPCFT*HAPDPRPLY 

5 

5502 

A 

5 

2 

73 


6 

5503 

A 

6 

27 

375 

EHSGVRQALCFGTASQRPSQQPAPS 
GPGPPGEPG *ERLCASHKAFISHKQS : 
H*SPQ*PCQAGVTLSRLQTTNSPRPH 
SQKGLRGPRTQTLSLTSQPTACSEN 
SQGSQPSPKRTLS 

7 

5504 

B 

7 

50 . 

204 

XKEGSLCDEYWNPAANLINVCSLFL 
RQGPRLALMQGEPVDKGCLGVLLE 
NK* 

8 

5505 

A 

8 

379 

623 

ATTVSVFPFTAKLLERPGLHLLVFLP 
NLQFPLQPLVS*LALLRGSTLTKQV 
PSAPDKPLLVSPSPAKHPPVPPSCGP 
GLQG 

9 

5506 

B 

9 

185 

366 

XHPGDGFRPNQEGDERPARKKTWV 

RDGGPHQGLFRSFHPQFFSRPSRAT 

AHVPAVYFSVEWX* 

10 

5507 

A 

10 

29 

308 

WLPPNPGRRREARQEEDLGPGWW 
APSGPLPQLPSAVLQPTQPGHGPRA 
SL** SVCFSFADKEGSLCDEY WNPA 
A/KPH*RLQPLPSTRPEISPL 

11 

5508 

A 

11 

663 

1269 

TAGTWAVASLGRLKNCGWKLRKE 

ALMGPTIPDPKSSPLAGLSSPFPWFG 

RKPITLECPTWERDPRNVGPPAPSP 

ARKSLPQPTGTTLQPYSPRDKAGPK 

KTLGPRG/APL*VRRTRPLN*WTPA 

DLGVRTRGAGPLPDPAGTLRPRGA 

VEPSVSACGKWAPSPTSQGCCEGR 

CDAVPKHEGLAHPTvLSINVFPVLN 

QKKKKK 

12 

5509 

A 

12 

190 

715 


13 

5510 

A 

13 

270 

713 

KLTILDCQFTG*QR*KFNG*NLRNR/ 
HSPSR WDG AKPL YKALKL* SSSSS V 
GAFIFIFTRSRLRAYLFSFAH/LRRPL 
LAGHLLCSPEQAVELSALLAQTKFG 
DYNQNTAKYNYEELCAKELSSATL 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucicu~[iac 
sequence 

SEQ ID 
NO: of 

Marl f in a 

pcputic 

sequence 

Me 
tho 

u 

SEQ ID NO: 
in USSN 

If 7/ / /VflOU 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence < X=l)nknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







NSIVAKHKELEGTSQASAEYQVL 

14 

5511 

A 

14 

1575 

1968 

REMGFRHVGQTGLELLTSGDLPTSA 

SQSAGITGVSHHTWPKTLFVLRQSL 

TLSPGLECSGTISAHCSPHLPCSSNS 

CAPASRVAESTEAHH/LCPDNLHISS 

REGASPCWPGCS*TPELKRPAHPCR 

DQLGH 

15 

5512 

A 

15 

185 

720 

KVSHVYLFLHRHGNHPISQTFPHLS 

PLSIPQNCHCHHGPFSMSCWRIKYL 

G I QLTRD VKDLFKEN * KPLL SKIKED 

TNKWKNIPCSW1GRINIVKMAILP/K 

ELEKTTLKFIWNQKRACIAKTILSQ 

KNKAGD1TLPDFKLYYKATVTKTA 

WEVQNRDIDQWNRIEPSEITPHIYNY 

LIF 

16 

5513 

A 

16 

1114 

2193 

GSFTKRVRRAFKVLRDNPTVAKLS 

QVKKHWYFTWNHKRLKIAKA1LSK 

KNKPGGITLPDFKL*YRATVSKTVW 

YWHKNRHINQWNRIRNPEANAHTY 

I*LIFDKGAKNIHWVKTSLFNKWCW 

EN*ISIC\KEWEKISANYPSDKGLITR 

IYKEL/K/QL*EKKSNNLIKKQAKDL 

NRHFSKEDK*MANRHMKKCSMLIT 

REMQIKTTMKYHFTPVKMVYIQKA 

GNDKC WQGCGEKGTF VHC* WECK 

LV*PL*RTVWRELEKL/E/LELP*DPA 

IPLLGIYPK*RKS/CVIKE1TVAKIWK 

QPKCPSTDKWIKKMWYIYTMDYYS 

ALKKNEILSFPTTWMELK1VILSVIG 

QSQKDKHCMFSLICGS 

17 

5514 

A 

17 

149 

328 

WQDPLQDPCCHQPFHLCLRR*TLH* 
LRQQ*WPLLRQLRGKIMLILLNTHP 
EHPCVLLDL 

18 

5515 

A 

18 

615 

734 

ENSC WTATLQMGKN WQSL* PVLTS 
YYR*DNSYWREILQV 

19 

5516 

A 

19 

1 

181 

MRARRLPWALTLVAELGWDTQGG 
DQTSPGGNDRMSMEAECESTTVSP 
LSCSIPTGCGQTREEVSARATPPPSL 
GASLLQTLTPDTHCTGVSA*KLATF 
FTFVGFLSSMNCLMLSKG*GTAKSF 
ATFFTFVGLLSSVYPLMSS 

20 

5517 

A 

20 

1 

665 


21 

5518 

A 

21 

401 

1739 

DNSHWRETLQM*RMWQSF*PFFNP 

C*T*ENSYW/MRNPTNVKNVAKLL 

AIPQPLLIIR*LILKRNPTNVKNVTKL 

LSDSQPLLNIK*YMLERNSTNVKNV 

AKLLIDLQILLY1SLFILERNLTSVKN 

VAKHLTGPQALLNIKDFILERNPSN 

VKfWAKHLYGLQP*LDIRGYTLER 

NPTNVKNVAKLLAILQPLLNIREFIL 

ERNPTNVKNVAKLLAVLQPLLNIRE 

FILERNPTNVKNVAKLLAIPQPLLIIR 

22 

5519 

A 

22 

618 

1655 

DIPERNASNVKNVSSHFASVYTKTQ 
HKCVYITEKSCKCKECEKTFHWSST 
LTNHKEIHTEDKPYKCEECGKAFKQ 
LSTLTTHKIICAKEK1YKCEECGKAF 
LWSSTLTRHKRIHTGEKPYKCEECG 


WO 01/79449 


SEQID 
NO: of 
nucleo-tide 
sequence 


SEQID 
NO: or 
peptide 
sequence 


Me 
tho 
d 


SEQID NO: 
in USSN 
09/770,160 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 


PCT/US01/08656 


Amino acid sequence ( X*=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 


KAFSHSSTLAKHKRIHTGEKPYKCE 

ECGKAFSHSSALAKHKRIHTGEKPY 

KCKECGKAFSNSSTLANHKITHTEE 

KPYKCKECDKTFKRLSTLTKHKIIH 

AGEKLYKCEECGKAFNRSSNLTIHK 

FIHTGEKPYKCEECGKAFNWSSSLT 

KHKRFHTREKPFKCKECGKGFIWSS 

TLTRHKRIHTGEKPYKCEECGKAFR 

QSSTLTKHKIIHTGEKPYKFEECGK 

AFRQSLTLhfKHKIIHSREKPYKCKE 

CGKAFKQFSTLTTHKIIHAGKKLYK 

CEECGKAFNHSSSLSTHKIIHTGEKS 

YKCEECGKAFLWSSTLRRHKRIHTG 

EKPYKCEECGKAFSHSSALAKHKRI 

HTGEKPYKCKECGKAFSNSSTLAN 

HKITHTEEKPYKCKECDKTFKRLST 

LTKHKIIHAGEKLYKCEECGKAFNR 

SSNLTIHKFIHTGEKPYKCEECGKAF 

NWSSSLTKHKRIHTREKPFKCKECG 

KAF1WSSTLTRHKRIHTGEKPYKCE 

ECGKAFSRSSTLTKHKTIHTGEKPY 

KCKECGKAFKHSSALAKHKIIHAGE 

KLYKCEECGKAFNQSSNLTTHKIIH 

TKEKPSKSEECDKAFIWSSTLTEHK 

RIHTREKPYKCEECGKAFSQPSHLT 

THKRMHTGEKPYKCEECGK/RF*Pl 

LNPYYT*DNSYWRETLQM*RMWQ 

SF*EIFNSY*T*DNSYWRETLQM*R 

MWQSJ*PILNPN*TYEDAHWRETIQ 

M*RMWESF*SILKAYYT*DNSYWR 

ETLQI 


23 


5520 


23 


3476 


MTLNEHAAFKHLFNKAHLAPPLIHL 

TLSGHSTCFREHRVGAKSNNPPASK 

GVWALQSARVKFAETTAGQKGMN 

TTWVFYYPNVASTWWGAMIPVHV 

VLPGGCHDASTLGDKEKRAGEAVL 

NVPGFQDSLESHGRIVNCLIPDVQE 

NNPSTGNES WLKSHQRLGEPTSRR 

WLITLPVTSRSNSIGHLKGTPGKSKE 

EIKATVCAPTLKNGFWIAERVMTVS 

GHEGAASSRALREELRLLFSSCAQG 

RLTPHIAGYPSKAKLREERSGSNICC 

SAIFAVLQPLLLIPRGTGSGVDLLQT 

PTDLQLRVLTVRRKTNKQEGHPHQ 

NPTCTSPS SK TKDRSTRRN VKKDTQ 

ELNSALRQVDLIDIYRTLHPKSREYT 

FFSAPHRTYSKIDHTVGSKALLSKR 

KRTEIITNCLSHHSAIKLELR1KKLTQ 

NRSTTWKLNNLLLNDYWVHNEMK 

AEIKIFFETNENKDTTYQNLWDTFK 

AVCRGKFJALNAHKRKQERSKIDTL 

TSQLKELEKQEQTHSKASRRQEITKI 

RAELKEIETQKNLQKINEFRSAV/PW 

QRHNKKK\KFWTNTPDEHQCKNPQ 

*NTGKPNPAAHQKGYPP*SSGLHPW 

DARLVQHTKINKRNPSYKQNQRQK 

PHDYLNRCRKGL*QNSTALHAKNS 

Q*IRY»WDVSQNNKSYL*QTHSQYH 


WO 01/79449 


SEQID 
NO: of 
nucleo-tide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


Me 
tho 
d 


SEQID NO: 
in USSN 
09/770,160 


Nucleotide 
location or 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 


PCT/US01/08656 


Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /=possible nucleotide deletion; V=possibIe 
nucleotide insertion) 


SECAETGSIPFENWHKTGMPSLTTPI 

QHSVGSSGQGNHAGERNKGYSIRK 

RGSQIVPVCR* HDC AFRKPYGLSPK 

SP*ADKQLQQSLRIQNQCTKTTSILI 

HQ*QTNREPNHE*TSIHNCFKENKIL 

RNPTYKGCEGPLQGELQTTAQ*NK 

RGYKQMEEHSMLMGRRISYHENG 

HIAQGNLQ1QCHPHQATNDFLHRTG 

KNYFKVHMEPKKSPHHQGNPKPKA 

QSWRHHTT*LQTILQGYSNQNSMV 

LVPKQRYRSMEQNRALRNNATYLQ 

LSDL*QT*EKQAMGKGFPT* *TVLG 

KLASHM*KAETGSLPYTLYKN*FK 

MD*RLKR*T*NHKNPRRKPRHYHS 

GHRHGQGLHV*NTKSNGNKSQNG 

QMGSN*TKELLHSKRNYHQSEQAT 

YKMGENFRNLLI*QRANIQNLQRTQ 

TNLQEKNKQPYQKVGKGHEQTLLK 

RRHLCSQKTHEKMLIITGHQRNAN 

QNHNEIPSHTN*NGNH*KVRKQQG 

HG 


24 


5521 


B 


24 


8442 


MIPARFAGVLLALALILPGTLCAEG 

TRGRSSTARCSLFGSDFVNTFDGSM 

YSFAGYCSYLLAGGCQKRSFSIIGDF 

QNGKRVSLSVYLGEFFDIHLFVNGT 

VTQGDQRVSMPYASKGLYLETEAG 

YYKLSGEAYGFVARIDGSGNFQVL 

LSDRYFNKTCGLCGNFNIFAEDDFM 

TQEGTLTSDPYDFANSWALSSGEQ 

WCERASPPSSSCNISSGEMQKGLWE 

QCQLLKSTSVFARCHPLVDPEPFVA 

LCEKTLCECAGGLECACPALLEYAR 

TCAQEGMVLYGWTDHSACSPVCPA 

GMEYRQCVSPCARTCQSLHINEMC 

QERCVDGCSCPEGQLLDEGLCVEST 

ECPCVHSGKRYPPGTSLSRDCNTCI 

CRNSQWICSNEECPGECLVTGQSHF 

KSFDNRYFTFSGICQYLLARDCQDH 

SFSIVIETVQCADDRDAVCTRSVTV 

RLPGLHNSLVKLKHGAGVAMDGQ 

DVQLPLLKGDLRIQRTVTASVRLSY 

GEDLQMDWDGRGRLLVKLSPVYA 

GKTCGLCGNYNGNQGDDFLTPSGL 

AEPRVEDFGNAWKLHGDCQDLQK 

QHSDPCALNPRMTRFSEEACAVLTS 

PTFEACHRAVSPLPYLRNCRYDVCS 

CSDGRECLCGALASYAAACAGRGV 

RVAWREPGRCELNCPKGQVYLQCG 

TPCNLTCRSLSYPDEECNEACLEGC 

FCPPGLYMDERGDCVPKAQCPCYY 

DGEIFQPEDIFSDHHTMCYCEDGFM 

HCTMSGVPGSLLPDAVLSSPLSHRS 

KRSLSCRPPMVKLVCPADNLRAEG 

LECTKTCQNYDLECMSMGCVSGCL 

CPPGMVRHENRCVALERCPCFHQG 

KEYAPGETVKIGCNTCVCRDRKWN 

CTDHVCDATCSTIGMAHYLTFDGL 

KYLFPGECQYVLVQDYCGSNPGTF 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tbo 
d 

SEC?!!) NO: 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; ^possible 
nucleotide insertion) 







RILVGNKGCSHPSVKCKKRVTILVE 

GGEIELFDGEVNVKRPMKDETHFE 

VVESGRYIILLLGKALSVVWDRHLS 

ISVVLKQTYQEKVCGLCGNFDGIQN 

NDLTSSNLQVEEDPVDFGKSWEVSS 

QCADTRKVPLDSSPATCHNNIMKQ 

TMVDSSCRILTSDVFQDCNKLVDPE 

PYLDVCIYDTCSCESIGDCACFCDTI 

AAYAHVCAQHGKWTWRTATLCP 

QSCEERNLRENGYECEWRYNSCAP 

ACQVTCQHPEPLACPVQCVEGCHA 

HCPPGKILDELLQTCVDPEDCPVCE 

VAGRRFASGKKVTLNPSDPEHCQIC 

HCDVVNLTCEACQEPGGLVVPPTD 

APVSPTTLYVEDISEPPLHDFYCSRL 

LDLVFLLDGSSRLSEAEFEVLKAFV 

VDMMERLRISQKWVRVAVVEYHD 

GSHAYIGLKDRKRPSELRRIASQVK 

YAGSQVASTSEVLKYTLFQIFSKIDR 

PEASRIALLLMASQEPQRMSRNFVR 

YVQGLKKKKVIVIPVGIGPHANLKQ 

IRLIEKQAPENKAFVLSSVDELEQQR 

DEIVSYLCDLAPEAPPPTLPPDMAQ 

VTVGPGLLGVSTLGPKRNSMVLDV 

AFVLEGSDKIGEADFNRSKEFMEEV 

IQRMDVGQDSIHVTVLQYSYMVTV 

EYPFSEAQSKGDILQRVREIRYQGG 

NRTNTGLALRYLSDHSFLVSQGDRE 

QAPNLVYMVTGNPASDEIKRLPGDI 

QWPIGVGPNANVQELERJGWPNAP 

ILIQDFETLPREAPDLVLQRCCSGEG 

LQIPTLSPAPDCSQPLDVILLLDGSSS 

FPASYFDEMKSFAKAFISKANIGPRL 

TQVSVLQYGSITTIDVPWNVVPEKA 

HLLSLVDVMQREGGPSQIGDALGF 

AVRYLTSEMHGARPGASKAWILV 

TDVSVDSVDAAADAARSNRVTVFP 

IGIGDRYDAAQLRILAGPAGDSNVV 

RLQRIEDLPTMVTLGNSFLHKLCSG 

FVRICMDEDGNEKRPGDVWTLPDQ 

CHTVTCQPDGQTLLKSHRVNCDRG 

LRPSCPNSQSPVKVEETCGCRWTCP 

CVCTGSSTRHIVTFDGQNFKLTGSC 

SYVLFQNKEQDLEVILHNGACSPGA 

RQGCMKSIEVKHSALSVELHSDME 

VTVNGRLVSVPYVGGNMEVNVYG 

AIMHEVRFNHLGHIFTFTPQNNEFQ 

LQLSPKTFASKTYGLCGICDENGAN 

DFMLRDGTVTTDWKTLVQEWTVQ 

RPGQTCQPILEEQCLVPDSSHCQVL 

LLPLFAECHKVLAPATFYAICQQDS 

SHQEQVCEVIASYAHLCRTNGVCV 

DWRTPDFCAMSCPPSLVYNHCEHG 

CPRHCDGNVSSCGDHPSEGCFCPPD 

HQFLEAWVPDHQPCQICTCLSGRK 

VNCTTQPCPTAKAPTCGLCEVARLR 

QNADQCCPEYENGRLVSVPYVGGN 


WO 01/79449 _M PC T/U SO 1/08656 


SEQID 
NO: of 
nucleotide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^-possible nucleotide deletion; \=possible 
nucleotide insertion) 







MEVNVYGAIMHEVRFNHLGHIFTF 

TPQNNEFQLQLSPKTFASKTYGLCG 

ICDENGANDFMLRDGTVTTDWKTL 

VQEWTVQRPGQTCQPILEEQCLVPD 

SSHCQVLLLPLFAECHKVLAPATFY 

AICQQDSSHQEQVCEVIASYAHLCR 

TNGVCVDWRTPDFCAMSCPPSLVY 

NHCEHGCPRHCDGNVSSCGDHPSE 

GCFCPPDKVMLEGSCVPEEACTQCI 

GEDGVQHQFLEAWVPDHQPCQICT 

CLSGRKVNCTTQPCPTAKAPTCGLC 

EVARLRQNADQCCPEYENPCPLGY 

KEENNTGECCGRCLPTACTIQLRGG 

QIMTLKRDETLQDGCDTHFCKVNE 

RGEYFWEKRVTGCPPFDEHKCLAE 

GGKIMKIPGTCCDTCEEPESNDITAR 

LQYVKVGSCKSEVEVDIHYCQGKC 

ASKAMYSIDINDVQDQCSCCSPTRT 

EPMQVALHCTNGSVVYHQVLNAM 

ECKCSPRKSSK* 

25 

5522 

A 

25 

364 

477 

VIEHLVSQDGLDFLTS*SARLGLPKC 
WDYRREPPRPVH 

26 

5523 

A 

26 

6838 

7166 

GSRRPGCHCNSHTGRRSSRHRGHLP 
SPAASRGHPSPSAGPPRS*GARRPSL 
YAGYEAYLSGGGAGRPGHPWQLLP 
HASVSQGCCAGQAAGR*RSGCTQR 
RGQSSPGQSQ 

27 

5524 

A 

27 

817 

1299 

RKSHIFFFFFLRWSLALSPRLECSGA 
ILAHCKLLLP/GFKPFSQLSQPSSWD 
YRHPPPRPANFLYF/SVETGFHHVSQ 
G\GLNLLTS*SAHLSLPKCW\DYRRE 
PPRPAENLSSLTQYLECTQFE1HLGS 
QTALEGRLVPVTYPLGGVEISGHPV 
FLLTSSCGR 

28 

5525 

A 

28 

506 

761 

DGVLLLLPRLECNSA1LAHRNLRLP/ 
GFKRFSCLTLLSPWDYRHLPPRLAIF 
FVFLVYVGFHHVGYAGLE\LLTSR* 
SARPRPPKIA 

29 

5526 

A 

29 

71 

425 

CRRKGVNMNAPLGGIWLWLPLLLT 

WLTPEVNSSWRYMIATGGSCRVMC 

YNELGLVSRRRILCQRYSPCILTLIY 

GEAKVLFVCGLSLLVHWPNNCAPSF 

RDNT*LLRFLHVIIVLLRPL 

30 

5527 

A 

30 

263 

463 


31 

5528 

A 

31 

287 

2919 

MASFPPRVNEKEIVRLXRTAGELLAP 

AAPFDKKCG\RENWTVALAPDGSY 

FAWSQGHRTVKLVP\WSQCLQ\NFL 

\LHGTKNVTNFKQFKDLPRQNS\DG 

GSEKIKPREHIIDCGDIVWSLAFGSS 

VPEKQSRCVNIEWHRFRFGQDQLLL 

ATGLNNGRIKI WDV YTG KLLLNLV 

DHTGVVRDLTFAPDGSLILVSASRD 

KTLRVWDLRDDGNMMKVLRGHQ 

NWVYSCAFSPDSSMLCSVGASKAV 

VAAILV*LRLCWHHSHTGAQWC*L 

GRKSGISGYRAGGDLYHRMK*PCIR 

LQGVLYVHRCWSMSTFCFSFFLFFF 

FKVISPTVKYTDS*VN*FSSFMELGV 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=l)nkno>vn; *=Stop 
codon; /=possible nucleotide deletion; ^possible 
nucleotide insertion) 







*QVKPI*CKVFGFQMVSLCYFLEFF 

QIPEISYVFDSI*NLYLFSFRNNVLCL 

CRKKKNQKGLLYSKRRDCLRINLQ 

AHI*YNRLK*TLESCLELFCTVNY*S 

LESKIVYELILK*LNCFIFK*LMIVVS 

LGKIRWLNFDLLKCNCIIFIK*HFHF 

VMWFNILVVCQRNF1WL*IFYLLAV 

SVSLPRLKLVTQAYCKQVIISKGDA 

NGVTIC*PYVFCLYIF*KSGSFWKKK 

EKGVCST*PYLFPYILVN*FLE*MDF 

SIALWLNCIAFILCLGLFLN*HLTETF 

EIEFACLP*LT*RLILI*L*H*AYSLNY 

S*FIMLNIIL1KFSSFSIRCAILSSVCLN 

EAITFAFLLQVFLWNMDKYTMMRK 

LEGHHHDVVACDFSPDGALLATAS 

YDT*VYIWDPHNGDILMEFGHLFPP 

PTPIFAGGANDRWVRSVSFSHDGLH 

VASLADDKMWRFWRIDEDYPVQV 

APVSNGLCCAFSTDGSVLAAGTHD 

GSVYFWATPRQVPSLQHLCRMSIRR 

VMPTQEVQELPIPSKLLEFLSYRI 

32 

5529 

B 

32 

51 

285 

XGDEKGAAQVAAVLAQHRVALSV 
QLQEACFPPGPIRLQVTLEDAASAA 
SAASSAHVALQVFSELGFPPAVQR 
WVIGRCL* 

33 

5530 

A 

33 

38 

347 

FGVAPGVSFLHHPRPHPARATASTR 

RAWNPQPALPQPSGSSAVGSPSPRC 

HRGRTEW\QCPVMDTITIWNSLGPP 

VLVGEVGSTFPTAGCLGRLPGGSR 

WSLE 

34 

5531 

A 

34 

331 

1257 

FRGCHRGKDRMAARVTHHQPWAQ 

KHALASWPSPPEASTLKGPPPEADL 

PRSPGNLTEREELAGSLARA1AGGD 

EKGAAQVAAVLAQHRVALSFQLQE 

ACFPPGP\IRLQVTLEDAA\LPHPPAS 

SAHVALQVHPHCTVAAFPGSRFFSE 

LGFP\PAVQRWFIGRCLCVPERSLAS 

YGVR\RDGDHAFLYLLSA/RSRS/LQ 

PQDLALKNPQEDGRGTWTLVSPIIG 

GYPQGPTAQLPPACPSPLPAVSWSCP 

FRHLHSMPQKRPGCEMCSTQRPCT 

WDPLAAAST*QPPEVTRGEWPFPH 

KSDISRPPLNSGDLY 

35 

5532 

A 

35 

616 

1017 

LYWEKIIFSNLKTPETLFLVMTSNIF 

H1FWEGNKLPHYTTQFSGFYFILWY 

FR\DRASL\CRPVWGAVVWS*LTAA 

SNSWWRCSSCLGLPSSWSLSPMPPH 

SANFKFY*FHLIFVGDGGLAVLFRL 

VLNSWPQAI 

36 

5533 

A 

36 

3 

283 

FYTQNIFYSVESKLHTSTL*D\HYFFF 
FFETESYS1AQGGVQWGNLGSLQPP 
SPGFKQLSCLSLPSSWNYRCAPPCP 
ANFVFLVEMGFHWIKPG 

37 

5534 

A 

37 

260 

569 

RENLDLGEAFISRCLPLHSLAYFLH 

NLSFKSREMHNMVFKS*QALKFIRR 

IENNHLLFYYFYFYFERKSLXHSPLG 

NGVGLCLKKKKNNGSYKVLVWSF 

DSTE 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SE<PBT NO: 

in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequene6TX=Unkno\vn; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 

38 

5535 

A 

38 

468 

849 

TSEEFQQFTIHLTGVLHCHPDLETG 

GYKTF*WKSLEN*IAFFFFSETESPS 

AP\RLECSGSISAHCNLLPGSSDSPAP 

ASRIAGTTGTHHHARPIFILLVKEGF 

HHVGQPGLKLLTSGDPPAPASQSA 

39 

5536 

A 

39 

97 

448 

GSHEQPWEVVTGSRQPAR*SSR*AI 

MRKPRA AVG SGHRKQA ASQEGRQ 

KHAKNNSQAKPSACD/GDVAEVTA 

FRGSLLSWYDQEKRDLPWRRRAED 

EMDLDRRAYAVKWPTLQDLASASL 

EEVNQLWAGLGYYSRGRRLQEGA 

RK 

40 

5537 

A 

40 

990 

1812 

RLPLGRRSPSEAAGAETAPSSLSAA 

MTPLVSRLSRLWVRWTCVAIMRKPR 

AAVGSGHRKQAASQEGRQKHAKN 

NSQAKPS ACDGRR* DGPGQAGIC W 

SVHLLRA/EATLPRGPWVWGLWAR 

*GQVNSVL/DANPFPPVWVSKVML 

QQTQVATVINYYTGWMPVTPGEEG 

KGHGSDPR*EPLLWGGCREGFLYH 

LHP*PCLFLPAWGYRSGPTLQDLGR 

AFLEEGDQL\WAGLGYYSRGRRMP 

EDTPARNGTAQRSLPQHIRPLNEWP 

LEWRLDACREP 

HI 


A 

41 

*T I 

360 

652 

IYLAGAOWLTSVILVLWKPRRVDH 

A A ^^J»X^X » f f Ami A T AAJ W Ami il A^Jl ▼ A*r A A 

LRSGVRDQPGQHGETSSLLKIQKLA 
RRHGACL*SQLLGRWRQENHSNPG 
DRGCSELR\CTPAWATEGDSVLKKK 

42 

5539 

A 

42 

1400 

1823 

NEKKSVFLRQSL/DSVAQAGVQWC 

DLGSLQTPPPRFTPFSCLSLPSSWDH 

RCPPPRPP/RFCFFLYF* *RODFTMLA 

RLVSNS*LQ/CDPPTLASKSAGITGM 

SYCTRPNQAGVQWWDLGSLQAPPP 

RFTPFSCLSLPSSWDYRH 



A 
/\ 


131 

481 

KKKKELEKGNMD*IOSSRR\ETIKM 
RAKIF*TTNTKLMKKNKTRSLVSEN 
FNKIGKALARLRKKEKTPITKVRNE 
TEDITTNFIE 

44 

5541 

A 

44 

1374 

1835 

ILPCNKPPWNSMACTTKHLSRSQAY 
RSAGAFIHWTGEAGVGSALLSLAL 
QKPWANQGIFFPCGGRSQRGVSRN 
TRVWVOARNWY*VTPTHRVLWMR 

A *V W if T 11V1 ▼ WW A w A A A A AJLV r AJ WW 1T1A\ 

TAPRPALAASSAAS\PSAVGSPVAA\ 

PSQPGLMTQMATTATEVWGYAV 

GHTLSYSENI 

45 

5542 

A 

45 

1 

1470 


46 

5543 

A 

46 

62 

526 

EEKLKKGKSFQEYSGSLLLSIASVGF 

LSPTDIAIAVPRQWEEMRPLDIV*LA 

EPEEVEVLEPEEDFEQFLLPVINEMR 

EDIVSLTREHG\RAYLRNRSKL\WRL 

DNMLIXQIKTQVEASEESALNHPPNP 

GET\AEG\RAAKRCEKAEEKARELQ 

KAK 

47 

5544 

A 

47 

721 

1030 

MGPWEPRPQMRT*CLLPLKPNSPPP 
TPSEE/PGHLPK*PLEVI*WPSPSPGF 
P/PAFRGQ*ARGHPPPPPQWNTPFSP 
PQQ\PLSAGKT*PLTPFPALPYLGTG 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 

peptide 
sequence 

Me 
too 
d 

SE™D NO: 

in 

HI UDOM 

09/770,160 

Nucleotide 
locstion of 
first codon 
for peptide 
sequence 

Nucleotide 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X ra Un known; *=Stop 
codon; /^possible nucleotide deletion; ^possible 
nucleotide insertion) 







RK 

48 

5545 

A 

48 

66 

386 

PMEIFVDDEANLTLHGVQQYYLKL 
K.DN E.K.N RivLr DLLD VLEr NQG VIF V 
KSVQRCIALAQLLVEQNFPAIAIHR 
GMPQEE/QFKDFQRRILVATNLFGR 
GMDIERVN1 

49 

5546 

A 

49 

434 

858 

CLSHTMDPYSPNLRPPTPPHNRWVI 

FVKSVQRCIALAQLTSGSRNFPAIXAI 

HRGDGPREGGGFFRVFRQF*RFFNG 

RIFWG YQPILGRGMGHSRRVN1 AFN 

YGHAWRVFDTYLAFGVGQRQGRF 

WATKGFGLFTFCVPMED 

50 

5547 

A 

50 

1 

660 

LALARNKSLNLKHIKHFILDECDKM 

LEQLDMRRDVQEIFRMTPHEKQVM 

MFSATLSKEIRPVCRKFMQDPMEIF 

VDDETKLTLHGLQQVYYVKLKDNE 

KNRKLFDLLDVLEFNQWIFLKSVQ 

RCIALAQLLV\EQNFPA1AIHRG\MP 

Q\EERLSSVFSSFKDFQRRILVA7NL 

FGRGMD1\ERVNI\AFNYDMP\EDF\D 

TYLHRVARAGRFGTKGLAITFVS 

51 

5548 

A 

51 

143 

387 

QPCLTRY*DTRCTNQ*ETTS*RLCKE 
PFRPGSFRPNWHLANWEN1ERLQL 
VS\TLRLIEEDSSLN*YSIIIFHSESYR 
YN 

52 

5549 

A 

52 

2 

1360 

VCVCVCVCVCVRQSLAPLPRLEGS 

VSILTHCNLRLLGLSDSPASASRGA 

GTTGMCHHTWLMFLFLVETGFRHV 

GQAGLELQTS\DPPALPFPKCWDYR\ 

VNHHARP*HSFYSIRLGDQNVMAS 

GQRPASMPCPVFLVQMSPAAVSTS 

VREWAPDSQRGHRDGHAKLWGVA 

DSPAPACPCTFGVTHETGWGSHLPS 

PKRQS/CYKGSQRPTQPQVIKQAPSS 

MATIIPIHQGDVEGGASWFTPPSAET 

DPRSGPRTLCREGKCR*LSPYSSIKP 

GLKMG*IRDFHSTKEKF*WGQNIDL 

LIFESLLTRRERANDFVVEGPTQL*L 

V*SIMNANLNSRKAELPNNGTSTA 

MGSASSFSVCLFYERETPRKAAAH* 

ENVWELTRRPFIFFEMEFCSwAQA 

GAQWCHLGSLQPAHHEFK*FSSA/S 

LPSGCDYRHPPPCPANFFYF*\RDGV 

PbRCPCjWPR 

53 

5550 

A 

53 

218 

380 

RKMKNSYPAPFAPRPIYSSPPPPQE/P 
Q*GGRDMAAIW*GALSIPPPVPDLL 
PLG j , 


5551 

A 

54 

76 

376 

YKIIFVLETCMYKVICRFANNTMHL 
SYTVIHKDPGKGRGIISPNLFYFIYFE 
MEF/SLLMPRLECNGTVAILAHRNLH 
LPGSSNSPASAS*VAEITGMCTMP 

55 

5552 

A 

55 

97 

437 

WTRTHRASTCHVAYQEDGLLHLRN 

TNDPENFPKSYHYHRIIIGGASG*QA 

TAPi7AT!i-rvnr;n\/inT ni?\/TDTD\ t 
J /\ivc./\ J rl i JL'OjU V lULiUr v lr I r ILvJ 

TTWGLEGTCENGDSLPADLMHQSP 
LVGQPTEDFRNTGGH 

56 

5553 

A 

56 

22 

424 

ALGMAH1TLFFFFLLLFCDSLALSPR 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possiblc nucleotide deletion; V=possible 
nucleotide insertion) 







LQCSGTISAHCNLVPPGFKQFSCLSL 

LGSWDYRCMPPC/RWLTFVFLVET 

GFHHVGQAGLELLTSGDPPALA/FP 

KC*DYRR\DPRAWALFVFLT*FFSKL 

KYHKAKEKWS 

57 

5554 

A 

57 

514 

835 

QFIFNVNKINSKTIIKDRWGPGTVGF 

TPVIPQHFGRPQQANHLRSGVRDNH 

PGQQGETPSLLKIQK*AGHGRGHL* 

TQLLRRLRQENHLNLGGGGCSEPRS 

RHCTPAW 

58 

5555 

A 

58 

234 

457 

SKTENIKYWLVHGELETALHRWRN 
SKMAY*LGK\QFLINLRTQLPYDSAI 
PFIGCIPFKYECWTYNKDLFTHVYI 

59 

5556 

A 

59 

1 

336 


60 

5557 

A 

60 

192 

432 

FDFNLNSPT WAGHGGSCL* SQHFGR 
LRRVDHLRSG1*DQPG*HSKTPFLL/ 
KNTKISWAWWRTSEIPAAREAEAG 
ELLELG 

61 

5558 

A 

61 

81 

439 

CEHHKAHPPPVSPYQSMAPSFTQRL 

RPKEQVSPTMPFSLVSTPIHLTSGTP 

AGLPASIPGPLQSPWPSTTTGT\PDKI 

QGPSPARPAQNSPVASS*ATSSPWP 

ARPPWTPLHSSLPALAA 

62 

5559 

A 

62 

297 

561 

SQHFGRPRQVDHLQSGVQDH\PVQ 
RGETPSLLKIQKLARGGG ARL* SQL 
LRRLRQENHLNPGGGGCN\EPI*HR 
CSPAWAIE*DSVSKK 

63 

5560 

A 

63 

3 

808 

FFFWEPEKAFIEEFEGVSSSSSPSQL 

GQQRKQDAGVLHSWNSALKNLNV 

PPPPPGGWCLWGTAALSSSQAGRG 

SGIGRGGGESGGTG/ASSAEGEAPG 

GIVSCA*GPGCRSSGAKGLGRLRAS 

SLQAPAAALIQAAPGVR*TGLGPYL 

SAVHAGPAAAAAALPGCLSVSPASP 

AAPVGATPRANGPLNSENHRCPPGP 

PGPQFGLGPLGPGPGSGPWAXAHSQ 

NMRAAESAAAAWLSVPSQSPRLSP 

SSSSSSSSPTAWNFSSPRDMAGLR 

64 

5561 

A 

64 

1005 

1150 

AWAWVCVSSGLGAPCGDGCCRGR 
GVASKCC\CAGGGCVSVG*GNVCA 
RA 

65 

5562 

A 

65 

3 

230 

LVEMGFHQPGQHGETPSLQKI*NKK 
\LAGHGGTCL* S*LLRRLSQEDGLSL 
GGRGFSEPGLCHCTPA*TTEQGLKK 

66 

< 

5563 

A 

66 

317 

503 

KKPKPPKPPWEPTTFG/TPAFIPPRGI 
WFLIAPCGWV*EEGGPSGGPWPWC 
PLGKTHGEGGKP 

67 

5564 

A 

67 

523 

741 

ERGFFFGPHPGGRGKKLG*WGPPFP 
GLKEFSPLRPP*EGGLRGPPPLPG/SF 
LGFLRKGGFKHGGQGGQNPGGG 

68 

5565 

A 

68 

498 

778 

VTINMMTGIVPYISILMLNVNGLSA/ 
PLERRRLAEWIKIHKPNICCLQEIHL 
THKDSYRLNVKEWKKIFHTNGNSK 
♦AGVAIVMSEKTDFKATTV 

69 

5566 

A 

69 

187 

488 

KRFGKNGFYPCGPGGLKPRALKEPP 

PLTPQRGGITSSSPPPQPKKTLFFGY 

WPKKSL*INPQGGLNPSQGGKP\WG 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SElJro NO: 

in USSN 
09/770,160 

Nucleotide 
locution of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 

i.uuun t f yuasiutK IJUUCVIIUC UCIClIOn, \— pOSSIDIC 

nucleotide insertion) 







GFPFLDNYGGCWNRPPGGPWGGYL 
K 

70 

5567 

A 

71 

1006 

1979 

GLGASILDSTTSTWSWNASRLLIGL 

KNSLFFFEME/FSLLLPRLECSGT1SA 

QV\NLRLPCSSDSSASASRVAGITGM 

CHHAQLIFVFLVEKGFHHVGQAGL 

ELTASGD/PTCLGLPKCWDYR/R*AT 

APGLFFFFLRQSFTLVAQAGVQWR 

DLGSLQPPPPPRFKQFSCLSLPSTW\S 

WVYRHAPPCPANFVFFFFFFFFLVE 

KGF\SMLLRLVLNS*PHDPDPPASAS 

QSAGITGVSHHTRPMSFKNIYNFFFF 

r r fc I tbKSx V AQAG VQWRDLSSRQP 

PPPGFKRFSCLSLSSSWDYRRVP/PM 

PG*FCIFRRDGVSPRWSGWSQTPDL 

K 

71 

5568 

C 

72 

126 

472 

MADCCAKQEPERNECFLQHKDDNP 

NLPrU-VRPEVDVMCTAFHDNEETF 

LKKYLYE1ARRHPYFYAPELLFFAK 

SSMNFGMKGRLRLPNRDSSVPVSK 

NLEKELSKHVARLSQRFP* 

72 

5569 

A 

73 

3 

873 

HELLSTPLAFGTMKGVTLMSLLFLFS 

SAYSRGVFRRDAHKSEVAHRFKDL 

GEENFKALVLIAFAQYLQQCPFEDH 

VKLVNEVTEFAKTCVADESAENCD 

KSLHTLFGDKLCTVATLRET\YGEIA 

\DCC\ENKEPERNES/CFCNHKKDNP 

N/LPPIG*GPEVGCGCGTGFFMDNG 

RRTFLEKILIMEIGQEGHPYFLWPRE 

LLr LLLKRVLKLLFTGMLAKL AGL 

KLACLLAKARWDFRNEGKASSAKQ 

RLQCASLQKFGERAFKAWAVTRLS 

QRFPKAEFAEV\SKLVTDLTK 

73 

5570 

A 

74 

849 

1277 

YNTTKLVPLYLCKMIFLLFCYVYVL 
RQCLA/SVAQAGMQWHNHSSLKS* 

OD\^T T/ifcPCiTi ni noon Arvx/n rT\*"\r\T* 

PPXGLK* SSHLSLPSS W\DYRC VPQRF 
SLLFIFCRRKGFFPILA*AGLEQLGSR 
NHLALASHLSVGIIGVSYHTQPVLT 
AAIAM VL Yr VNK.LSVLL 

74 

5571 

B 

75 

120 

323 

ITRRYAEFSSALVSINQTIPNERTMQ 
LLGQLQVEVENFVLRVAAEFSSRKE 
QLVFLINNYDMMLGVLM* 

75 

5572 

A 

76 

154 

432 

QLPEAGGPGLQEPLQLGELDITSDEF 

ILDEVDVHIQANLEDELVKEALKTG 

VDL*LHSGERTRRD*QLPEAGGPGL 

ybPLUjLGbLDn SDEr ILDEVDVHIQ 

ANLEDELVKEALKTGVDLRHYSKQ 

VELELQQIEQKSIRDYIQESENIASL 

HSQITAC 

76 

5573 

A 

77 

2 

630 

FFFVSGPAAHDLFHAVMGRTLSMT 

LKHLDSYLADCYDAIAVFLCIHIVL 

RFRNIAAKRDVPALD/RVTEFWSLM 

PNRPRTLLVLHDSALTLDSY*PGIIN 

LYSHSFAPEAWLLFDSPFSNHCPPT 
PTP?Y*PT N*M'MPWC'T PCipcrMippvirf 

TSDSD*AHRYWEQVLALLWPRFEL1 

LEMNVQSVRSTDPQRLGGLDTRPH 

YVREGKGNKG 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEl^DNO: 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence^ X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; V=possible 
nucleotide insertion) 

77 

5574 

A 

78 

175 

2385 

QLPEVRLRGMAAAATMAAAAREL 

VLRAGTSDMEEEEGPAGG\GPGLQE 

PL\QLGELDITSVDEFILDE\VDVHI\Q 

ANLEDELVKEALKNTGVDLRHYSK 

QVELELQQIEQKSIRDYIQESENIAS 

LHNQITACDAVLERMEQMLGAFQS 

♦PSGSIKL/CRFRTLQEQSGAMNIRL 

RNRQAVRGKLGELVDGLWPSALV 

TAILEAPVTEPRFLEQLQELDAKAA 

AVREQEARGTAACADVRGVLDRLR 

VKAVTKIREFILQKIYSFRKPMTNY 

Q1PQTALLKYRFFYQFLLGNERATA 

KEIRDEYVETLSKIYLSYYRSYLGRL 

MKVQYEEVAEKDDLMGVEDTAKK 

GFFSKPSLRSKNTIFTLGTRGSVISPT 

ELEAPILVPHTAQRGEQRYPFEALF 

RSQHYALLDNSCREYLFICEFFVVS 

GPAAHDLFHAVMGRTLSMTLKHLD 

SYLADCYDAIAVFLCIHIVLRFRNIA 

AKRDVPALDRYWEQVLALLWS\RF 

ELILEMNVQSVRSTDPQRLGGLDTR 

PHYITRRYAEFSSALVSINQTIPNER 

TM\QLLGQLQV\EV\ENFVLRVGSW 

SFSFKGREAACVFW1QQLWTWMLG 

VLM\E*ERAADDSKEVESFQQLLNA 

RTQEFIEELLSPPFGVLRWHL*KEAE 

AT IPRfSOAFRT RfiPFARVTnT TRfiF 

GSSW\KSSVESLSQDVMRSFTNFIN\ 

GT\SIIQGALTQLIQL\YHRFHRV\LSQ 

PQLRALPARA*AHSTFHHLM 

7R 
/o 

SS7S 
-/J / -/ 

A 

70 


1 JO 1 

PI FIOT PCt} PR Ml Tr»RMY*TMH5*5K' 

CQD/HSVCSWVKAFWRAVVAHAC 

NPSTLGG*GMRITRSGVRD*TDQHG 

ETH 

J-> Jl 1 1 

79 

5576 

A 

80 

132 

356 

KDKIHIIISIILKKFDKI*YSLIIK\TL*K 
LGME*TYLNIIKV1YDRPTAS1ILSGE 
KLKSFPLKSGR*OECPLL 

80 

5577 

A 

O 1 

108 

335 

J J J 

1^ IVJL-TVII llllOll JL*1VXV1 JL/XV1 I OJL*ll£V\ 1 JL* 

KLGME*TYLNIIKVIYDRPTASIILSG 
EKLKSFPLTS AR* QECPLL 

81 

5578 

A 

82 

3 

6742 


82 

5579 

A 

83 

499 

1018 

PTRVFSITAKLINGGVAGLVGVTCV 

FPIDLAKHSPQQPALGKPCYKGMIR 

LPDRRRLGRRASSAMYRGAAVNLT 

LGTPEKAIKLAANDFFRRLLMEDG 

MQRNLKMEMLAGCGAGMCQVW 

TCPME*PTRVFSITAKLINGGVAGL 

VGVTCVFPIDLAK\TRRSNQHWESH 

VTKE*SDCLIEDGSGGG/PSSAMYR 

GAAVNLTLGTPEKA1KLAANDFFRR 

! 1 MPHfiMORNT KMFMI AfiPnAfJ 

uw1V1D1^VJ1V1\^I\J^1^iV1V1C1V11^/\VJV^nJ/\vj 

MCQWVTCPMEMLKIQLQACWTP 

GRPSSGLGLSTLHLQVLHNWFGFHP 

QAPLCHPHCLG 

83 

5580 

A 

84 

3 

305 

GTROGCPLSPL*FNTVLEILVRHS/RS 
SSSSSSSCLTADP/MVLHIENPKGSIK 
♦VLELINEFSQVAGYKINM/QKTVAF 
LYTNN*LSKKEIKKTIQF1IASKRT 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SE<nbNO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 

COdom A^Dnssihlf* nitr)/>ntisfo /fo!/»tinn> \:=nr»cci hl» 
^ u ■* * t |#wooii/ic iiifwicutiuc uirictiuiif \ poadluic 

nucleotide insertion) 

84 

5581 

A 
r\ 

85 

OJ 


781 

ss\si v orr oM V V HA* Aoi V cJvJfcJs. 
IGNQGTKL**FEEMI/LYIGNSRASA 
DTLLEIKDFSKISGYRFNIHQSVMFL 
YFSC 

85 

5582 

A 

ou 

456 

71? 

iNir 1 i ir V IIjoL 1 V V v^Avj Vv^ WLU 

LGSLQPPLPG\SSDPHASTS\*VAGIT 
GVSHGAWLIFFSFFPFLRRSLPLSLQ 
FGQQSETLF 

86 

55Jtt 

JJOJ 

A 

87 
o / 



IN IN r r o&K V LLLbrKLcCN \oRl* AHCN 
LR/LPGFKRFFCLSLPSSWDYRLLPP 
RPANFLYF/SV*TGFHHVVQASLELL 
TSDDPPAL 

87 

5584 

A 

88 

372 

666 

NVCFIRTGTDCIISEHNGMKLAITKE 

V T CUT C\f\/\T\\nf fj xtd t?l xTVi/^r^\/vt?r? 
lxi_iCrHA£r i V WiSJijNKrLNNv2u VKcE 

ITREIRKYFEMNENKNTKYQN*ECV 

MTTVCRGKFIAANVHIKKQDSNYV 

R 

88 

5585 

A 

89 

36 

350 

KLQLHNLKARIAAIHQAQ*LTPVIPT 

1 \X/T7 A 1/" A m> VI ODD IT\/V ACT /"*/"\/ri** 

U W cAJtvAUKr LfcrKJb VKAoLOQ/r * * 
GTHVHKTYKIARAWVAKHLWVPS 
YFKRLEVRRVALSPRGV/NGCS*RLI 
LPLPSQP 

89 

5586 

A 

90 

58 

375 

VFYNKTTFKVFIIAIICSLIYFVCLHSI 
VI*FFIL/CYCRVSEIFGYRCFIIKLLL 
ls.i>Lly*JL.*r Vri/MLr AClLLLr/WLNL 

yflrlstivff'kkllivltffflyrs 
iifs\cfylllfsff\cffgctl;csclc 
lqlclffsfsyflihvlr 


JJOI 

A 
/I 

01 



UJVI1L Yil;iNJrKL>o6K.NrLGLINKYSK 
VAGYKINTQKSAAFL*TNNYLKN*P 
/MRTIPFTIAASSSYLETYLTMEVKD 
LYTENYKM 

91 

5588 

A 

92 

31 

358 

NVKSGQNLTMGEGSVSQGSIFSSLG 
vjnK.1 Vav V 1MVRCRC/PAHRGLSR 
WLPST\SSGTQ*GP*NC*PNPPITLLR 
PPRPRQRCPSLCQFPP*TSPRQRPSQ 
PPQGPPEFP 

92 

5589 

A 

93 

1 

1253 

MRIPSFLLNLQDFEDKMEIKRYRPE 

GPLATSAQSHVSTAPLISTQIPPHVP 

PLFLDCRHLTPASLFDQTLIPKKAPS 

NCVTDSYRKTSEIHPGSLFLILNLQF 

RTSTSNCCFSGSGKEALTGSIGRERS 

PLLAQTPFPTLKKSQRSATLECDEE 

ASLWENPLRDHGLFPASEHRLPLPL 

NQQKGPPLRTSPAAHSPPNFAG\MP 

PVASSEGLTSIYSQLSPIG\PPGRRRQ 

RGCPY*VQLHGDWPLCTAVYT*AR 

RSVAL*SRFCG*QTRR*TRWQRNPP 

VCSG/HKLREFPLKLELFPQIQDP1G 

HQFVISVGQVRGH*STQKLYGPIRS 

AorUADvuuAKuivXurULuorrrAr 

NLHPGARALPGSCWSHLPGVRSQE 

VSFLDSGSGSRVNPPTAEDEAWESG 

LCSSHPACQEHTKDL 

93 

5590 

A 

94 

216 

1374 

RPQGMPVSSPPPPKLLLDPLAQLFS 
GQQDPQPLEKPHLQCLGRELGSGR 
RGGGWSPGVENRSQTLFFPGHRAP 


WO 01/79449 


PCTYUS01/08656 


SEQ ID 
NO: of 
nucleo-tidc 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; Y=possib)e 
nucleotide insertion) 







APGDAEGSGP*SFSGDARPTGHLLP 

PRKG\HL*SSGELRPGRCNSLQAFG/ 

HRGLGVPRGALGVSLAGSSFPSPPR 

RRPSG*AANNSVASEGAR*ALGRG/ 

GPEAMP/DEFVRGVSGG*GPRGRSR 

LLIEW*TEAMFADPTRTAGGVGTD 

GKLLPVPGQDH*ARPWKPREIRAVS 

RAGHVG/LPAC*EIPAQSLSGP\RGA 

SAAIFGSCVPHGGSTRAGMVVRVA 

RGSPRfiFOOLVLTRXfiTTRTPfrMNrc 

SSPLAFSPLKSPG/MGGOLAGP/PGA 
PSARWSHGGSAGRWCGRGWGQE 
LVSWIHRSVTGSPHKFVGCWRQTS 

94 

5591 

A 

95 

282 

612 


95 

5592 

A 

96 

19 

1153 

DLTPGKWDQQEPGRARAPGWRLG 

AGGGEPQSNPLFPRAPRTCPRGCRT 

NWPIKLLCNGKKPEAPGGARGASL 

SEPSPLPGWPWSTGSEEADLEDRTEN 

ERPKGFDSHDVEMLKP*NPKVPNCE 

GERGCSRAGSTLEPSPGESSAQVQE 

KKDYAQQWF/SNRGQLRPHMPLPT 

PLGH*AAAGGSGRENVIPLGMCLVS 

GGDRCC*TPCNPRWEGPSPTPK*PF 

RQRWRNSRVRSIAHGILADGIHGFG 

DQLDLGSEEKAPASEGTLEVLPRAN 

GGVALPVA*RWEDGRRHRLQGKV 

nnm cawpiI Pfiif qfi q<ipprppupq 

DSL*C*GCRGLGPL*CRGCPRLTSG 
ASPLPPPPGNLVGGSGPGDPRPSCQ 
LLPPGKGHL 

96 

5593 

A 

97 

429 

945 

KSVLSTLNWAQPRHWPETLPWVPS 
*PETSLPPPGGS/APPTPDMD*LNSAS 
PNSAPPAC*NPSPACRLSSLPAITPVS 

r*nPT\PCiTRr»APVPAFTP\ll/I PPA A<5\P 

FKAQTASKG*PSHMWLPPLPLLTFP 
KPV\PSPALLP*APSQPPKGVPQAPS 
QHPLTPSHRTCSPAGLLTP 

97 

5594 

A 

98 

178 

603 

SQHFGPRWTNHLRS*IQDHPG\QHG 
KTPSLLKILKKKLAGHGG AHL* SQL 

AAWMTE*DSVSKKKRPGTVAHAC 

YPSTLGGQGGRITRSRDRDHPCQYG 

ETPSLLKMQKLAGHGGTRL 

98 

5595 

A 

99 

405 

V07 

fiSFLFF(* l FFF*DRVPPr c ;P\ffW , ?AVV 
uoritrrv/rrr i/iwrrvorturv on v v 

QQPQLTSALTSWGSHLSLLSSWEHR 

DV\PPCPG*FFIFCRDGV/LTVLHRLV 

SNFWAQSILPPWPPKVLGL 

99 

5596 

A 

100 

3 

307 

FFFLEPSLACRQAECNAHLAH/CKL 

TsKWFTPF'SnT ^1 RMQWTMVRr'PPQRT 

GNFFVFLVETGFHCVSHDGLDLLTS 
♦SVRLSLPKCWDYKGESLHRAQNY 

100 

5597 

A 

101 


469 

PKMAOTOTCGYI HI TT Al MPFYFRN 
TQAKKNLKRDC*RPSRMPKDLACC 
KSIQNKIKQKIGRKK 

101 

5593 

A 

102 

265 

446 


102 

5599 

A 

103 

283 

398 

NWQEKCTFQIIGGRKRMSFRI1LINF 


WO 01/79449 PCT/US01/08656 


SEQ ID 

nucleo-tide 
sequence 

SEQ ID 
inu: o! 
peptide 
sequence 

Me 
tno 
d 

SEQ ID NO: 
in USalN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X-Unknown; *=Stop 
codon; /^possible nucleotide deletion; \= possible 

iiuucuuuc iiiacriiuuj 







FHN* DRTVC YVP 

103 

5600 

A 

104 

283 

397 

NWQEKCSFQ11GGRKRMSFRIILINF 
FHN*DRTVCYVP 

104 

5601 

A 

105 

2 

1012 

AEALVESFWKAKQHTKEELKSLQA 

KDEEKNENEKAKAACSAAAMEEDS 

EASSSSTGDSSQGDNNLQKLGPDDV 

SVDTDS1RRVYTRLLSNEKIEIAFLN 

ALVYLSPNVECDLMYHKVYSQDPN 

YLNLFIIVMENRNLHSPEYLEMALP 

LFCKAMSKLPLAAQGKLIRLWSKY 

NADQIRRMMETVQQLITYKVISNEF 

NSQNLVNDDDAIVAASKCLKMIYY 

ANAYAVTKNLGLYYDNRIRMYSER 

R1TVLYSLVQGQQLNPYLRLIVRCD 

HI1DDALVRLEMITMENPADLKQFY/ 

RGI*RRTRWVAAFWDRASEPKANSI 

GFGGSQLWMPTPVASYT 

105 

5602 

A 

106 

966 

3172 . 


106 

5603 

B 

107 

1 

2271 

MAGKASESWRKVKDTSCMAVTRE 

NEKDAKAETPDKTIRSRETYYHKNS 

MWETAPMIQIISQGVTPTTHENYGS 

TIQDEIWCLTNFCLDDMLSFVLESC 

TNHCAYCLNVWYRKRAAAKHLIER 

YYHQLTEGCGNEACTNEFCASCPTF 

LRMDNNAAAIKALELYKINAKLCD 

PHPSKKGASSAYLENSKGAPNNSCS 

EIKMNKKGARIDFKDVTYLTEEKV 

YEILELCREREDYSPLIRVIGRVFSSA 

EALVQSFRKVKQHTKEELKSLQAK 

DEDKDEDEKEKAACSAAAMEEDSE 

ASSSRIGDSSQGDNNLQKLGPDDVS 

VDIDAIRRVYTRLLSNEKIETAFLNA 

LVYLSPNVECDLTYHNVYSRDPNY 

LNLFIIVMENRNLHSPEYLEMALPLF 

CKAMSKLPLAAQGKLIRLWSKYNA 

DQIRRMMETFQQLITYKVISNEFNS 

RNLVNDDDAIVAASKCLKMVYYA 

NVVGGEVDTNHNEEDDEEPIPESSE 

LTLQELLGEERRNKKGPRVDPLETE 

LGVKTLDCRKPLIPFEEFINEPLNEA 

LEMDKDYTFFIVETENKFSFMTCAF 

1LNAVTKNLGLYYDNRIRMYSERRI 

TVLYSLVQGQQLNPYLRLKVRRDH 

IIDDALVRLEMIAMENPADLKKQLY 

VEFEGEQGVDEGGVSKEFFQLWE 

EIFNPDIGMFTYDESTKLFWFNPSSF 

ETEGQFTLIGIVLGLAIYNNC1LDVH 

FPMGCLQEANGEKRNFSVTWETLT 

QFLYQSLKDLIGV* 

107 

5604 

A 

108 

264 

378 


1 fiO 

erne 

5605 

A 

109 

297 

353 


109 

5606 

A 

110 

1034 

1195 

MQKKMIFQQTTAPLNPVQTV*RHP 
TPKRKECPSLRRQSTLLRMMWYLP 
CDQWS 

1 JO 

Jul/ / 

A 

ill 

1075 

1826 

LGLQNRNFGYKKHFWVLTDSEPAG 
VGGGEEWFFFSLGSRTDRSGAISPLI 
TLRTLAAKGAHQALTKTMEMMSD 
KKRI*VTFLFEFKMGRKAVETTCNI 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleotide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; V=possible 
nucleotide insertion) 







NNAFGPGTANERTVRWWFTVKLCK 

GDEGLEDEEP* WPDPLEVDSDLIAR 

TIIKAD\PLTT\TREVAEEFNIDH SMV 

FP\HLKQIGKVKKLNK*VPHEPSKN 

KLNFLEASSLILCNNNEPFLSRIVTW 

DENWILYDNH*QPAQLLD*EAPKPN 

LHQK 

111 

5608 

A 

112 

540 

724 

EAMFYTWEGEWAQEIFVGLKKIRL 

GNI\AHAYNPSTLGG*GGQIA*AQEF 

DTSLDNIARPVS 

112 

5609 

A 

113 

1 

370 

QRSRGRG SLRIGQTCLRRDMLSQEL 
PRLEFPLLLLLMLLMP\PPPCPAHRA 
TLFDPTWESLDARQL\PAWFDQAKI 
G1LIHWGVLTGPSYCIERV*RNWQM 
EKIPKNVEFMTDDYPPRYTHEDF 

113 

5610 

A 

114 

151 

379 

PFYVENP**YTLKNFLLELISNYNKV 
AKYKINIQRSIYFLYASHKQVDFKV 
QTQ/LPFTLA/SL/RMK* FSISLTK* VQ 
D 

114 

5611 

A 

115 

17 

214 

KQRLSYCIYKTTKTYATYKEIHR/LE 
VNGCKRIYHANTNQKKAGVAILISD 
KKHLRQEYYQG*KEML 

115 

5612 

A 

116 

249 

675 

QYISVTRCHISMLTLNLNGSNAPLK 

RYSLTE*IFLNDTTV/CIPRHTDRLKV 

KG*RKTCYTNRKQKQ*/GIAILMPD 

KTDVMSSSSSSSRK*IIVKGSILQED 

MTIQNIYTPNTIAPI/R*VKLILLGLK 

G*IHSNTIMVGKFSIR 

116 

5613 

A 

117 

67 

373 

FCDCHHFILMFKSPHIWPVGIFSSWL 
LCFFWACLHHSLSIALLSCTKRYSG 
LILYFLCSSFEITVSSKSSVSF*RRMV 
FRNQVLGSRCACCC*GVAAPRPFP 

117 

5614 

A 

118 

366 

795 

AWVEQSKVLIKEGGIQLLLT1VDTP 

GFGDAVDNSNCWQPVIKYFDSKSQ 

D\YLNAESQVNRCQMPGNRV\HCCL 

YFI APSGHGPLHN*RLPPSGRIG * YM 

FVTTWHCLLLRLKPLDIEFTKHLHE 

KVNIIPLIAKADTLMPEEC 

118 

5615 

A 

119 

105 

702 

AGSSVSLGFCPAAAAHKPRGGALR 

LPVFRRRAQQGPDYALAGVARQPA 

GTCRRRCNRSHCRAEDPQWPTPAA 

APAAHSPHMSLGESGLGKLILINSLF 

LTDLYSPEYPGPSQRIKKPVQVYILV 

FLIDDKLE*Y*YTQSTCCNFHYAS\Q 

SWQPAINYIDSKFEDYLNAESRVNR 

CQMPGNRVQGCLYFIAPSGHGPLH 

N 

119 

5616 

8 

120 

7 

177 

MSVSARSAAAEERSVNSSTMVAQQ 
KNLEGYVGFANLPNQVYRKSVKRG 
FEFTLMVVE* 

120 

5617 

A 

121 

2114 

2945 

KSVAFLCTNNVQVQAENHIRNVVIS 

VTIVAPIHKIKYQRMYPAKEVKELYR 

ENYKTLMKEIIDDTKKWKNIP/C*W 

VGRK/LIYRYNTIPIKLSTSFFTELEK 

KILKFIWNQK/HSRIAKAIL/AQKYK 

AGGITLPDFKLYYKTTVTKTAW/YY 

WYKNRHRDQWNRTENPEIKPYTCN 


WO 01/79449 


PCT/US01/08656 


SEQID 
NO: of 
nucleoside 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQID NO: 
in USSN 
09/770,160 

Nucleotide 
locution of 
first codon 
for peptide 
sequence 

Nucleotide 
location or last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







QLIFDKINKNKQ*GKDILFNK*CWK ~ 
NWLPICRRMTLGPYLSPYTKITSRW 
ITDLNVRLQTLNILEENLGKNLMDIS 
/V GKEFMTRP/PKAYATKTKIDK*DL 
IKLKSFC 

121 

5618 

A 

122 

3 

113 

GLDLLAS*SARLGLPKCWDYRSDW 
GPGPVCGHLCRG 

122 

5619 

A 

123 

145 

540 

FFVFFVEMGFHCVAKAQAYNIFFFF 

LRWSLALSPRRECCGAISVHCKLRL 

PGSRHSPCLSLPSSWDYRRLPPRPA 

NFFFVFLVETGFHPC*PGMGLDLLT 

S/SIRPASA/FPKCW\DYRA*AIAPGK 

MRLFNSL 

123 

5620 

A 

124 

739 

835 

LAKISNSDVLKLSMLHKSENSISHK 

TGAERNKYLLIKLKVI*LLTL*VNIC 

FFQLQFYVK* SFQ1YV A WKVLIRQS 

Y*FLPVIFSIIYFFYL*LIFFV/CDTFCF 

*SHFLLFIFYVYFI\LVTMRITYNILEL 

♦HFNLNLFQLKFNHIPKFYCYIYIAL 

L/CFMLLM*QIISLFIVYH\VTDLLITF 

YAFAF*IM*KIKSRVTNQNYNRTVF 

MFVYWLPLPESFVYSYSLLIYLHSY 

CLEFIYFNLKDLTLPECQ\FRDKWIF 

FQF*KKIRKCLNFS/CHF*RISFPAIYF 

SIDRFLHYFKYIIHCLLAFKVSAREIS 

C 

124 

5621 

A 

125 

48 

492 

HPTGPGRRSHPRPCPRRSLTLSAPSP 

WPPG SLQRSLLDPQRSPWRPRTQAC 

TRSAHALRHTIPRSTLGVTVGLEAA 

PPPQHLRAKGT/PPVPGAQPPPGPRP 

WPTQLRERPSPEPPPPGLGLPGSKTP 

ALPARPRVG*MGPKAQPHTPF 

125 

5622 

A 

126 

536 

669 

YLNVGNWVGNPMAHTSNPSTSGG* 
GGWST*GQELKTILTSLVKHS 

126 

5623 

A 

127 

793 

829 

GRCHLAHGGVQGSRIKPQQLGAWG 

RRQRDIGNRGSRGLWGEKEEKAGE 

RKDEPALARSTSQAPSRLHPCIFNPL 

GVRYPRWALHPQLCAPP*AHVSVS 

TQ1PRQRPQVAVTLSVPPISG*FRAP 

QGKLPNGQMLYGRHPHPLQAPPTA 

RASPSHVLTLLGTEQPPRA*THSPEK 

W*GVPAWLRTSPRPRPVGRREQVT 

LIWKPKQN*SAES\PPSHRAYPEIPFR 

LLCLQPRTGPVLLLGP*SSKCPEPPC\ 

TKSKPGWGKACSPLTGPCLPSP/PDL 

PSVPSPPSPVLPDPNRTATASRNPTV 

TERYLNASLCWSQPDLPQGPIITDM 

PSAPAVPLTSDNCPSMSPAPSGKAV 

RQMPPGTWWGSG 

127 

5624 

A 

128 

322 

386 

IRCFALRFSSLLSFIHLY*DT* HPDT* 
HPDIQTPGHL/HTQTPDTRTPGHPDT 
ETPDTQTLRRLTPRHLDT*HSDTQT 
PDTQTPGHSTPKNL 


JOZD 

A 

129 

323 

516 

AGGRFPSWDPFSSRGSQASKPVRMP 

PTR*MRR/RGRQPCPGHDRRTQLFA 

VSAPSRDLQNCSRERF 

129 

5626 

A 

130 

238 

583 

MADKQISLPAKLINGGIAGNLIGVTC 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQW NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X*Unknown; *=Stop 

CQ'lfin* ^nnnccihla nitr^lnntirln ilalAiaAHi \ * » i 

*- w - JU,, f ' puaaiuic lluLlcoiloc OCICllullf V^pOSSIDIC 

nucleotide insertion) 







v r riuuAK. j KiA^lNy vJNGQR V Y FSM 
SDCLIKTVRSQGYFGMYRGAAVNL 
TLVTPEKA1KL\AANDFFRHQLF*GR 
AEA*PCLKRLLGGFWG 

130 

5627 

A 

131 

3 

492 

SSGLGCAGTRDSQLSIRRGLSSTRRS 

GGGGDGDGTPARDLQLGWLHLLH 

GSGDRRGIEECAIKRKDQGVNQKK 

JvKKJK i oKLOKMobCSN VCGSRQAQ 
AAAEGGYQRYGVRSYLHQFYEDCT 
ASI\WEHEDDFHILRSPTRRS/SYIFE 
GVDSFSGTLL*YLAWTG 

331 

5628 

A 

.^9 

t 
i 

94 <; 

Uru 1 OKbP W 1 P YS'EGDPRGRPRPR 
PLGPPP/TAHAADG SYRHSASGPGS 
WTSPFPSPGGGEKSGRTGQRVWKF 
GFWSWLCH 

132 

5629 

A 

133 

554 

1049 

GRTGGGLGLLHGHTRLADTDLLDR 

GMLKDTLAQAPPPPLGEAYCHQGP 

GPWAGGGALSPGTRLQAGIQG/P/PE 

r^LPyLrU'bPRP'PP/AQVVAGCGPA 

DLPPGGCPGCSGCSPHR*TAFIKTSA 

NPATLAGVGWG*GHPEGVPHTASE 

TGSDLQL*PTAIGHTGGPW 

133 

5610 

A 




L)r VOK.OISI VELPGRIAHCFHCLPVLH 
VCLSLSVLCVCFVLFWCFSTSLF*RII 
VFERYLTFLVCVLCC*GLCFICTCF\F 
YCSLVF*LFASCFLYSS 

134 

5631 

A 

135 

71 

484 

EIFCYCVKYTYIQTHAPFKFFRFIYL 
FRDRVSL*PRLECCGVVLAHCNLR/ 

LPOPK*SSHLSLLSSWDYRRTPPMPS 
WFLCFS*RRGPHHVIQVGLELLGSS 
SLPAL ASQC WDYRREQPWPG *K VF 
LSSAYCLFHLTLY 

135 

5632 

A 

1 JO 

1 


bCjnrvjKPKQLDAPRSGl'DQPGQHG 
ETPSLLKIQKLAGHGGRRL*SQL\LE 
RLNRQENHLKPGGGGCSEPRSRHCIP 
AWVTERD 

136 

JwJJ 

A 

I J / 

LOJO 

1 Qf\A 

fyU4 

uGWlTRSGDRDPSLAKHGETPSLLK 
IYKKLAGRGGRSL*SQLLRRLRQEN 
G1NPGGRACSKPRSHHCTPAWAT\G 
DSASKK*KIKKKVV 

137 

5634 

A 

138 

421 

1155 

KICGSYYPLFLLATFSEESFQSMLIK 

TTLSLNVGLVLSWKRWQGASNGKL 

QGLSEFCESQGAQNLTLRALRLLHD 

LQIGEKKLLVKVDAKTKAQLDEWK 

AKfCKASVNGNARPRNCHLMTDEEA 

LVDEETKRRDQMIKGAIEVLTREYSS 

ELNAPSQESDSHPXRKKKKEKKEAIF 

D1?17\D\/ADT \1DVDI 11VCTMO A TC\JTBT\ 

DYIDLISREISIFRDTHKRSYGD*CK 
MKLSAWKVTRNRINWKKRK 

138 

5635 

A 

139 

338 

395 


139 

5636 

A 

140 

340 

1248 

RPLVLANCIQEVIKRIVDMQVPLISG 
MQ/AWFNIVKQINVIYHLNIMKDEN 
HIISIHGEKAFHKIOHPVIMFTl NKTP 

REGMYLNNTKTIHEMTTAEITSQGK 
WNAFPVGSHMMQE/CLSPLLFN/LIL 
AVLARAMK*/QKEIKLIEIRKKEVKL 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 

first codon 
for peptide 
sequence 

Nucleotide 

lUlaUUlI OI IJIM 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=*possibie nucleotide deletion; \=possibfe 
nucleotide insertion) 







YLFVDDMIFCAENHKESTKILLELS 
NIFSGFAQYSISTKN*LNFHTFNNKQ 
L*KKILK/QLPLAGELK/RKKYLKIN 
VKKbbvr 1LKSTNIDAND*KHI*RH 

"xt wrcrz 1/ T\Jl\?v *7 \/XTxrD?v /f *cr> TTX7 
fNr i * WrUrUNl vK'L YNNrM'rRFN 

MISIK1PISFC*RNKKQAGKMAHVC 
WPG 

140 

5637 

A 

141 

47 

411 


141 

5638 

A 

142 

1 

343 

GRLQA1TDKRK1QEEISQKRLKIEED 
KLI^QHLKKKALREKWLLDGISSG 
KEQEEMKK/RKSTKEEAILKKLKSIE 
RTTEDIIRSVKVEREERAEESIEDIYA 
NIPDLPKSYIPS 

142 

5639 

A 

143 

460 

976 

LLRIGKEAELGGRGRLPGHSQIKRK 

LQEEISQK/RV*KLGEDKLKHQHFE 

DK/VPLREKW\LPRWNPASGKEPGR 

D* RSQNQPRPSTQIPGS*NKVSLRLE 

KEIXQDLEKAELQISTKEEAILKKLKS 

IERTTEDHRSVKVEREERAEESIEDI 

YANIPDLPKSYIPSRLRKEIN 

143 

5640 

A 

144 

79 

533 

SSIMTFLESSA VPPH WTGQDGRVC 

WTGWIPQCQAGSAPE/RS*VFINSAG 

QKSADTGWSSSKPQN*HLSSFHQA 

VVGMIQPSHSQFLMKRKAASPRKL 

EWEH/LQPLHPMTLLYR*DGKPFR* 

VLLSTYTYCSSRDRPKSSGKNARRF 

FAHObS 

144 

5641 

C 

145 

354 

416 

MKESPGGELPQTGKKPVFLF* 

145 

5642 

A 

146 

3 

145 

SSSSSDFAGQTL*STQTVQN*FKKVL 
KPGRLYPVPIATMGIKEPLIS 

HO 

^AAI 

Do4J 

A 

A 

i An 

214 

ASA 

464 

FCGLLLLHPVSADF*PAEL1NTQEPQ 
ERCQLDTGESSRVQHTLPSCPVQCG 
GTAELSRNVMIGASELKCLHPSPKL 
EY1LPGN 

147 

5644 

A 

148 

246 

730 

SSIMTFLESSA VPPHWTGQDGRVC 

WTGWIPQCQAGSAPE/RS*VFINSAG 

QKSADTGWSSSKPQN*QLSSTGAAL 

PLASLSRERAWWDDGKHRLTTPMT 

VPQRAVQQL*ETSG**DWRQKVQIF 

QQAVVGMIQPSHSQFLQREDVIML 

RPFGLHLSWEENGS 


J 0*0 

A 


to 

288 

FGGGYIPTWGKGEGILALELNHDIS 
REFCSAPALASRPPPTPPPLLPPT/PP 
LPAPRSPADATPRRVGGPLR*ALKP 
RAPGPGWSRRRCRSWW 


JOhO 

A 

A 

1 jZ 

1 A/C 

lOo 

344 

KQILLLPPRLEG/NGQNSG*WKFPLP 
GPSLFSCPSFQTSGNYGPPQQARAIF 
WKFKIKTGFHGVTREGLNFLTSGSA 
PLGS 

150 

5647 

A 

153 

38 

349 

RTAKSGSTKFSLNSKYGTVLAVLF 
MKKILVLRLSPKKNDQTVKYIKRPL 
TSLKIREIHIKTALLY YLTE\* KLLKF 
DDTCH**A\WRNYCWRVCVLIQPL 

VY fvV^lVI VY 

151 

5648 

A 

154 

220 

970 

ESRTRGAEAAGLAPSCTSPQAHGPA 
PLPTHVCCGVAIGMEPGHTAISPWV 
ELAVHLTGLVSSHDALGMMPSQQG 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USorN 
09/770,160 

Nucleotide 
location of 
first codon 
Tor peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino add of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Slop 
codon; /=possible nucleotide deletion; V=possible 
nucleotide insertion) 







*QWGRQRGLASGN*GRMSFPNSWP 

VTPICAARLPPGLLLICGFDGAGHSD 

RSEGF*GLRFPFCFKRQ/RSHSVSQA 

RV*WCDHGSLQPPSSGLK\HPPVSA 

SQVAGTTGMWHRAWLVCLFETES 

CSVAQARVQWRDLGSLQPLRPGFK 

QSSCLSLLSSWDYRHVAPCLASLFV 

CLRRS 

152 

5649 

A 

155 

193 

369 

HLN**FSNLIFFETESPSVT\RGIISAH 
RNPRLPGSSDSPTSASRVAGTTDT 

153 

5650 

A 

156 

626 

1017 

FDSCLFLFCFVCLRCASSVAQAGVK 

WHGLSSLQVPPPGFTPFSWLSLRSS 

WDYRHPSPHLANVFCFLGFFVFLVE 

RGF\TVLAR\IVSIS*PHDPPTPASQN 

AGITGVSHCAWPTLVCLNAKFSIVV 

FVHKD 

154 

5651 

A 

157 

1 

336 

TVSQAPSPESNPHGRRGDYHRKLIG 
QTFEWV/VRRHGGRAIGPRLSRVTK 
AAGARPPEPKDFGFPEAARRVMGIT 
PVLDLGRQPVRGALVELRGAHGWR 
AGGGTGSCGIPARL 

155 . 

5652 

A 

158 

2 

320 

VVAVSQAPSPE/SEP*FPVTRGHHGR 

HGDYHRKLIGQTFEWV/VRRHGGR 

A1GPRLSRVTKAAGARPPAGAGEG/ 

LDRVGFDLINARIPPAKGANGSSPPR 

GACDRPEVI 

156 

5653 

C 

159 

177 

380 

MPTGADPLRGGDACIYQIKTNPVSP 
SPAPAGGRAPAALVTLDNLGPIARP 
PWRRRPIRTSAPINFRW* 

157 

5654 

C 

160 

1 

417 

MDATCHGCLQFQIMRNKKFQLLSP 
SSQHFRCMTASGGKQLLCRTG QKM 
EHPIPXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxx 

XXXXXXXXXXXXXXXXXXXGRPV 
* 

158 

5655 

C 

161 

1 

403 

MDATCHGCLQFQIMRNKKFQLLSP 

SSQHFRCMTASGGKQLLCRTGQKM 

EHPIPXXXXXXXXXXXXXXXXXX 

XXXXXXXXXXXXXXXXXXXXXX 

XXXXXXXXXXXXXXXXXXXXXX 

XXXXXXXXXXXXXXXXXXX* 

159 

5656 

A 

162 

513 

1086 

QPQVASSYSAGQGRRWNTPSLGKIT 

RSGDRDHPG*HSETPSLLKIQ\KLAG 

CGGRHL*SQLLRRLRQENGVNLGG 

GGCSEPRLRHCTPAWATE*DSISKK 

REKKKKKERKKKKRKKKKWKKKE 

RGRGEAGEEQGEEEGERRRDKKKK 

EKKEREETREEGRRRRRRRKKKRR 

RRKKKEEERTTKRRRRTRKKK 

160 

5657 

A 

163 

2 

935 

WRRSTPAPSATSASPSRRCL*SQLLG 

RMRQENRLNLGGGGCSEPRSCHCT 

LASPAGTQSCSRCTSQQGVQSDIPC 

TAAAPETAPRRGSAGGTWCRRRAP 

P 

161 

5658 

A 

164 

34 

1026 

LLALGQSSCL*SQLLGRMRQENRLN 
LGGGGCSEPRSCHCTLASPAGTQSC 


WO 01/79449 


PCT/US01/08656 


SEQID 

NtfV nf 
l^w» (11 

nucleo-tide 
sequence 

SEQID 

MA, A f 

peptide 
sequence 

Me 

inu 

(i 

SEQID NO: 
in uoon 
09/770,160 

Nucleotide 
location oi 
first codon 
Tor peptide 
sequence 

Nucleotide 

tinrt t\f tact 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno>vn; *=Stop 
codon; /^possible nucleotide deletion! ^possible 
nucleotide insertion) 







SRCTSQQGVQSDIPCTAAAPETAPR 
RGSAGGTWCRRRAPP 

162 

5659 

A 

165 

45 

69 

RKNQRIYQIARKRLNEMARISPLRS 
MI1LN VSGLNFPLKRCRLAE WTuSSP 
DP1ICCL/QKTHFTG/KDIYRLKIKGW 
KKIFHANGSQ*QTAMMNTNRERTK 
GYTK 

163 

5660 

A 

166 

48 

72 

rkgqriseiaikrlnkmarisplisiii 
lnvsglnfplkryrraewtmsspdp 
iiccl/qkthftg/kdiyrlkikgwk 
kifhtngsq^rtaisqsamcnnnre 

RVKGFPK 

164 

5661 

A 

167 

118 

639 

ATVPSQQLIFDKDSKAIQWRDT\LFN 

KWCY*IN*ISTCKKLDIDSYLAPR\T 

KINPKRILDLNVKPKT1KCLQENTGE 

NCWDFGSGKHFLDMTPKMQSTK*Q 

ISKL1KI*NFSSKTQHFALLIIRIF*KTL 

LTGSKYKATTWKK/VFVNHIPDKRL 

ISQIYQELFRTQXTKNPTSDW 

165. . 

5662 

A 

169 

435 

808 

KNLCNNKKFHRDEGWAQCLTPVIP 
ALSEARSRLYHLR\SGVRN*PGQHG 
* FCHGLY WIMQNLAG\RGGTCL* SQL 
LGQLRQENSLNLQGGGCSEPRSRHC 
TPAWVTERDSVSl 1 1 IKIFTRMNLN 
R 

166 

5663 

A 

170 

167 

197 

VKFHKIKLDGEDTTYGGFDGPGLM 
Y\VYLISSDGH*FTQLHQEL 

167 

5664 

A 

171 

45 

259 

ARMNSKLALA\*ALQKRSLRHQSNV 
FSMFDQSQIQEFKEAFNMIDQNRDG 
FIDKEDLHDMLASLGELGQGQG 

168 

5665 

A 

172 

90 

468 

IMKLLT\RAGSFSRFYSLKVGP\KAK 
ATAAPAGAPPQPQDLEFTKLPKGW 
LIAPLENYPPG* *IG WFIKAGT*SEDF 

VTA T /"* r l " l''T TT T O' IT*/*** O 1 M'I't T f~\ A Clprm iwrr 

NALGTTHLLSTTCSVTTNGASSFTIT 
RGIESADGPLTVTASREYMDHTVE 

169 

5666 

B 

173 

89 

186 

XLKYFQTVTDYGKDLMEKVKSPEL 
QAEAKVLL* 

170 

5667 

B 

174 

85 

298 

XLEGALVRRQAKEPCVESLVSQYF 

QTVTDYGKDLMEKVKSPELQAEAK 

SYFEKSKEQLDTPDQEGWERELV* 

171 

5668 

c 

175 

279 

533 

MAKDLMGEGPRTPELHAERQVFTF 
EKFKGSSLTPLDPRKAWERELGLTS 
LELIFRGNFGNHSLATPVESFPRTIW 
SFQTPGWAF* 

172 

5669 

c 

176 

260 

389 

MDFFAQKKKKKVCMYVHMSTQR 
WLPNETNQNINVLGFLNFLSC* 

173 

5670 

A 

177 

84 

1008 

KVCCRYRKANGGKGSPVQEVPDG 

APEGAPLQQGP\PGWLPLPTTQSVS 

APPGPGESPTENQPMFKQTDPQMKS 

FWTKMGSPTLPSPNSV/AVSHFPSPH 

FISN*EWEQNQPLSLVLSGRGDELH 

SDGGQKTQGLDKQQLP\RGWHGLV 

SFGRAACSKLGKNLRPQEIKWSSKL 

LIT DTDCOr»P*PDl \ J /~*\JTJ f*\~\\T /"^ r^TSt r*o 

HLrJriloyC'orLVOVJbQwGGJvLCiS 
VGLLLQPKGGIPTALSPCALPAGHP 
TLPYGNNAGTDLRLHTEPEGPHGEP 
GLPARWGQDGMEPRWAAAGLGKG 


WO 01/79449 PCT/US01/08656 


SEQ ID 

nucleo-tide 
sequence 

SEQ 10 
INV; 01 

peptide 
sequence 

Me 

tho 
d 

SEQ ID NO: 

in 1 1 COM 

in UooIN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; V=possible 
nucleotide insertion) 







YLLQASRGVGSETGQVGFLFGKKT 
KSNRLIAVNDSVHFLL 

1 /4 

DO /I 

A 

A 

178 

79 

336 

NCCNTIKSISEKP/LANSIR*AKKQEG 
FFQISGIRQGCLLSSFLF1ILEVLARG 
NR*D/IKCIQIGKKKVKLSLFSETMR 
FNIWKRLR 

175 

5672 

A 

179 

3 

370 

SVCVRAHESVVKSEDFSLPAYMDR 

RDHPLPEVAHVKHLSASQKALKEK 

EKASWSSLSMDEKVELYRIKFKESF 

AEMNRGSNEWKTVVGGAMFFIGFT 

ALVIMWQKHYGL\ASKWDYEKNE 

WKK 

176 

5673 

A 

180 

24 

1173 

RAVAAGSGGRMLATRVFSLVGKRA 

ISTSVCVRAHESWKSEDFSLPAYM 

DRRDHPLPEVAHVKHLSASQKALK 

EKEKASWSSLSMDEKVE/LWVLKG 

PTGAPSSSRKRVCDRASAHFWAYC 

LESSLAQEGCSAGVSGHCARAPVY 

VLTSHLALPADRIFC*APFSVLSGGS 

LSAYLLGKT*LTVNY*KKLITMHSV 

WDERGRKITGLNRP*YCNSSKK*FC 

SFNLHLKRTVCIFFLPCPVTCLRGHV 

CARMCVNMMWPGLVYPSALCFLL 

HKCGFGEKWLNVAEEGAADLCAC 

KWLSSLPPVYRIKFKESFAEMNRGS 

NEWKTVVGGAMFFIGFTALVIMWQ 

RHYVYGPLPQSFDKEWVAKQTKR 

MLDMKVNPIQGLASKWDRV 

177 

5674 

A 

181 

1 

738 

RRSQRYPFPLHGDLRAAGCGRSLPR 

SRGAPRRGLALFRSRDTGCRGRSRQ 

GSGGRMLAYQGYFTLVGKRAI\STS 

V\CVRA\HESVVKSEALFASQPYMH 

RR*HHPCPE\VAHVK\HLSCQPEGT* 

KEKEKAFLEASLSM\DEKVEVVFAL 

KFKE\SFA*RLNKGAPNRVGKDRFV 

WAGAIVSFNRVFTALRLSCWQK\H 

YV\YGPLPRKSF*QKSGLAKQTQEG 

CLDNEGEPPSQGLASK\W\PYEKNE\ 

WKK 

178 

5675 

A 

182 

82 

395 

1CSFAPSSIFWGSAFTGTCSSTSVRA 

AAPPGTPQRPSMDAHMTGRKGRLS 

♦TSFFTWSMVTALLGVWTSVSVVW 

FDLADYDD*L*ALAIYD\ADGDVRF 

LRGLSH 

179 

5676 

A 

183 

134 

594 

V1TLTIVSPALVANNSARGLTLPAP/P 

LPTGSRRTEGNPSWEPGDLGSSLASC 

*NPPGAPGPKS*SQTGRPALPALASR 

LSGPLLQLPCFLSVPRSPERAPGPRH 

KLLLLQSLMAVSFISQFKCHLPGEV 

LPDRAAPGGSWPGDSRALTKSPPCT 

180 

5677 

A 

184 

3 

404 


181 

5678 

A 

185 

2 

851 

AAAPAPAPAPTPTPEEGPDAGWGD 
RIPLEILVQIFGLLVAADGPMPFLGR 
AARVCRRWQEAASQPALWHTVTL 

SSPLVGRPAKGGWKAEKKLLASLE 
WLMPNRFSQLQRLTLIHWKSQVHP 
VLKJLVGECCPRLTFLKLSGCHGVTA 
DALVMLAKACCQLHSLDLQHSMV 


WO 01/79449 


PCT/US01/08656 


SEQ ID 

NO nf 

nucleo-tide 
sequence 

SEQ ID 

peptide 
sequence 

Me 
mo 
d 

SEQ1D NO: 
in uoon 

09/770,160 

Nucleotide 

IUCUIIUI1 UI 

first codon 
for peptide 
sequence . 

Nucleotide 

lnrntinn nf Inct 

IUVUUUII Ul 1*131 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X= Unknown; *~Stop 
codon; /==possible nucleotide deletion; \ s possible 
nucleotide insertion) 







ESTAVVSFLEEAGSRMRKLWLTYSS 
vjl lAlLuDLLGSCCPQLQVLEVSTGI 
NRNSIPLQLPVEALQKGCPQLQVLR 
LLNLMWLPKPPGRGVAPGPGFPSLE 
ELCLASSTCNFVS 

182 

5679 

A 

186 

2 

568 

EFGKDSCGNISAALPPLSAQVFTAPE 

ADPHPLEVSGTPRVEGESSRL*LHIT 

CDTLGLVSTLGSGSGTLGAQRCSVG 

MSACLPGSLFLLFPPAGRYQRRGHP 

SKPGMGRKEVTAKAVRVGLAPATL 

SVSLVDLSLSSPNPSCPSVSPQLVGE 

CCPRLTFLKLSG\CHGVTCLTLWSC 

LAKACCQFHKPW 

183 

S680 

A 

187 

2 

333 

ARDSTSTTEMNPQVLFQRV**QFLLI 
TTSWRKVISQTFGRLVDTGSKL/TV 
QMPRISSPSVRVAACGGQVIDGVLL 
KVQLTVDP*T* WTDLV1FS/SAFE* VI 
GIDILGSECS 

184 

5681 

A 

188 

2 

363 

AREVFTQHS\HLTYH*T1HTGEKPYK 

CIECGTAFGVRSCLSIHLVVHTG*LP 

YRCHECGMVFMRNTHLVRHQLIHT 

GEKPYMCNECGRAFIAHSNLATHQ 

AIHTGEKPYICTECGTVFTQN 

185 

5682 

A 

189 

361 

1026 

RKYLPPRPTFNAEALPLKVR1WGRG 
LISKLYH*LYQEL*L*LYQGL1TILLE 
KKLI*KLDKNLNRHFSKEDIQMANR 
HMKMYSTSL1MREMQ1KTTMRYPSP 
PQLK YLLSQKTGNNKC *RGCGEKG 

T*f l Mini Mi Jt/ott i >/\t\f iimnmnimtri / 

TLVH/WWKCILVQPLWRTVWRYL/ 
RKLKIELPYNPAIPLVGIYPKERKSV . 
Y*R*ICSMFTVALLAIAKIWKQSKCP 

bADE W INK! WYA YTTEY YSA IK 

loo 

J0o3 

A 

1 OA 

1 CO 

ICC 

366 

FlISMNFVFLYFVFDLSrNEILLGLKE 
WSIYLSS/DHSLSSLCSFYLLLLMFFL 
CMLLLLLLCSSI1IS*P 

187 

5684 

A 

191 

10 

284 


188 

5685 

A 

192 

3 

438 

LFISLLSISEKIIENCWV*LSAARS*A 

LRKLAFF*ATRSFF*ARDILGRFHLF 

F/CNFFLGLLFlDWILSYSSMSFLIHL 

LHPAGQQQASTICCSIICQANLHTIF 

WQFVCIRCADYHIPLYTG1SNLTNDI 

SVCHTNYHPVIGVW 

1 QQ 

toy 

OOOO 

A 

193 

4y/ 

752 

T-\/'">'\ FY f T T T\l~» T r'nXIP A ¥¥ AT TT1 "VT1 TY T T>/ 

DGVLLLLPRLECNSAILAHRNLRLP/ 
GFKRFSCLTLLSPWDYRHLPPRLAIF 
FVFLVYVGFHHVGYAGLE\LLTSR* 
SARPRPPKIA 

190 1 

■5687 

B 

194 

922 

2057 

YPNRFPLVMDSEKQRNFNAESTIGS 

HIHGPRIVAGLHAPTLMEEDEDALQ 

ETVRASIRKEQRNSRHDGGDGIRKA 

HAAIPRESRSMKRSPRKEVKKKRW 

NRPKMSLAQKKDRVAQKKASFLRA 

QERAAES* 

191 

5688 

A 

195 

1492 

1790 

SQTLGGRGGQITKSGDREHPG*HSE 
1 roLJLWyivvLAGHUNUUL WM^LIKK. 
LRQENDMNPGGRGCSEPRSCHCTP 
AWVTEQDS1SKKKKQKQKEGLGGS 
A 


WO 01/79449 


PCTAJS01/08656 


SEQ ID 

NO- nf 

MVS* Ul 

nucleo-tide 
sequence 

SEQ ID 

NO- nf 

peptide 
sequence 

Me 
tho 
d 

SEQ1D NO: 
in 1I1SN 

III L*OOJv 

09/770,160 

Nucleotide 

Inpotmn nf 

first codon 
for peptide 
sequence 

Nucleotide 

Incittinn nf tact 

JUiiUJv'J VI Idol 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X«Unkno\vn; *=Stop 
codon^ /—possible nucleotide deletion; ^possible 
nucleotide insertion) 

192 

5689 

A 

196 

178 

572 

QAGSCTRTSQPRDSRGSDIQPVGLA 

FGRTPAELQELHLSSPRPGRGAVWA 

CUbLbPvjPLPLLSl lTSGSQPSLQLSSL 

PQSPLFCPLPPF/PPPRPPPRVGLVPPP 

*LTHVPGLQPTGRPPPSPSRSPPAPPP 

Q 

193 

5690 

A 

197 

209 

684 

PWDCVHACLRGGWHSANRGHFR1 

GGPGRPKAPFLPFPASLKVQAL1PYP 

GVHPGRPLHPCVPRRMQRLCGTRD 

PEKLASCDIVVDVGGEYDPSXRHRY 

DHHQRSFTETMSSL/DPLGSRGKTK 

LSSAGLIYLHFGAQWLAQLLGTSEE 

DSMVGTLYDKMY 

194 

5691 

A 

198 

2 

720 

IPGCMIRHELLPPCRELLMGHRFLR 

GLLTLLLPPPPLYTRHRMLG PES VPP 

PKRSRSKLMAPPRIGTHNGTFHCDE 

ALACALLRLLPEYRDAEIVRTRDPE 

KLR\SCDIWNVGGEYDPS\RHRYD 

HPQRSFTETMSSLSPGKPWQTKLSS 

AGLIYLHFGHKLWAQLLGTSEEDS 

MVG\TLYDKMYENFVEEVDAVDN 

GISQWAEGEPRYALTTTLSARDARL 

NPTWNHPDQDTEAGFKRA 

195 

5692 

A 

199 

209 

684 

PWDCVHACLRGGWHSANRGHFRI 

GGPGRPKAPFLPFPASLKVQALIPYP 

GVHPGRPLHPCVPRRMQRLCGTRD 

PEKLASCDJTVVDVGGEYDPS\RHRY 

DHHQRSFTETMSSL/DPLGSRGKTK 

LSSAGLIYLHFGAQWLAQLLGTSEE 

DSMVGTLYDKMY 

196 

5693 

A 

200 

2 

720 

IPGCMIRHELLPPCRELLMGHRFLR 

GLLTLLLPPPPLYTRHRMLGPESVPP 

PKRSRSKLMAPPRIGTHNGTFHCDE 

ALACALLRLLPEYRDAEIVRTRDPE 

KLR\SCDIVVNVGGEYDPS\RHRYD 

HPQRSFTETMSSLSPGKPWQTKLSS 

AGLIYLHFGHKLWAQLLGTSEEDS 

MVGYTLYDKMYENFVEEVDAVDN 

GISQWAEGEPRYALTTTLSARDARL 

NPTWNHPDQDTEAGFKRA 

197 

5694 

A 

201 

94 

660 

LHLKNSDGYCLIVYQKRFIPVTFIHF 

CFLILSLKFNNIIPLNIFANGEKYFVY 

KFTYSY\YWKFLTC\FVELPVNCLFI 

SFSHFFLMSFVIFL* * ILGMLY VLVL 

LIFNFTYICIVIAYF*LFVVIQTFLHFY 

LLKFINLFL*SFSGFCVLLRRVIPIPRI 

YICFIRILYNTSITLFSTYLEE*FSFDM 

198 

5695 

A 

202 

3 

347 

FFEMEF/SLLLPRLECNGVILVHCNL 
RLPGSNDSPASAS*VAEIIGVCTASS 
*IFVFVGTLTQ*KSRLVDQAGLELL\ 
PASSDPPILTSQSAGITGVTTDIQPPF 
FLSSFANTEWT 

199 

5696 

A 

203 

32 

403 

APIPDAMGHFTEEDKATITSLWGKV 
NVEDAGGETLGKLLVVYPWTQRFF 

UKr UInLooAoAIMOKK VKAPU*NV 
\LTSLGDALMHLDDLKAPLANLRER 
T/CDQGCWVNPENF*LLGNVLVTVL 
AI 


WO 01/79449 


SEQIO 
NO: of 
nucleo-tide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


Me 
too 
d 


SLWtti NO: 

in USSN 
09/770,160 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 


PCT/US01/08656 


Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; Y=possible 
nucleotide insertion) ' 


200 


5697 


204 


94 


361 


FCQLDSLYTESQSL\DSTVL*LAEHM 
KFIKTSDY*GALDTFTKHLQMSVDA 
YE**MISILNPSSLSERQSLLLFIVLD 
LSLVPYLLIFEF 


201 


5698 


205 


265 


408 


MTLSCSNLVFFFLFK1TVFIMTMVTP 
QCKGGPDSVCFSTLFVNKCPV* 


202 


5699 


206 


10 


419 


MRGGHSWARKGGMRGLIRNERES 
GGGEQTD*ASKLKRGNSNR1TPFAY 
MDTYNASSSSSSSSSSSSSSSSASKLE 
AELGQTGLLP1PLGGGGGAFSTKTV 
RSGESEGGLWKQRKRWSLEGRCGR 
VSGWECGGAEAMK 


203 


5700 


207 


165 


248 


MDTYXXXXXXXXXXXXXXXXXX 
FQARS* 


204 


5701 


208 


337 


428 


MILRVDDFVPLALLPQSFPHRSHYD 
PNPAA* 


205 


5702 


209 


531 


I860 


PSKPPNQCFLSLSQATSAGTHLSQD 

TESLTQVAKGIS* GSQGHGGGTL*M 

RGGHSWARKGG/H*EGSSGMRGRA 

VEGNKQTRLLN*NGEIQTELPLLHT 

WTLTVQMRKVTPREGELSCPRASK 

LEAELGQTGLLPIPLGGGGGALSTK 

TVRSG\GVRGVFGSRENDGLWKVD 

VEGSVAGSRAGA*AMKGEPKQIPK 

LTL*S*P*ENPNGNAVSFS*ARGKL*/ 

SFTKTLAGPAGAPAPAPPP\GPRWPP 

PA/DCGHTRPPLPSESLEAELKAGDS 

PSLALDSLSP*PPPTPPAGPRRSQGPP 

GAPAGALGSRCPRQQVKQTTLGS* 

RGRAGAGNTRRRGSGPHAAPIGSV 

DLRSGAPATAGPCG\RAASVGAGPR 

RGRGGRGVLPAPPWGT*GAPKGPRR 

RGPAGWSQTGSARPCGPWASRGGP 

KPRPCVHGGRRPGDAPGVVTAPRC 

GR 


206 


5703 


210 


32 


452 


207 


5704 


211 


38 


618 


APSPDAMGHFTEEDKATITSLWGK 

VNVEDAGGETLGRLLVVYPWTQRF 

FDSFGNLSSNPSAIMGNPKVKAHGK 

KVLTSLG\DAIKHLG*SQRAPFAQA* 

SELALVTKLHVNDPGGTFKLLGEML 

LVTRFWAIPFSAKEFHPWRLQA\SW 

QKQKMAEDGDLELASALVPSRLPL 

SSLAHECRAFQGYGFILASNYK 


208 


5705 


212 


137 


368 


DGVYLWTHRPYCGLGSLNFGSVUV 
LP*VKAYGWMVLTSLGDAIQPLAD 
PECSFXGQLRELRCDMLHVDPEDFR 
LLGK 


209 


5706 


213 


60 


317 


FPCLVCCTLQENSGKPILCPRRTTAQ 
LGPRRNPAWSLQAGRLFSTQTAED 
KEEHLHSIISSS*SVQDYT\SKHKFQA 
STYKH*SIA 


210 


5707 


214 


406 


HEDKLCTVATLRETYGEMADCCAK 

HEPERNECFLQHTDYNANLSRLMR 

PEEDVMCTAFHDNEETFLKKYLYD1 

ARRHPYFYDPELLIFANRHKAAFTD 

CSQAGD*AAWLVPKLDDYLYEL*A 


WO 01/79449 PCT/U SO 1/(18656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Mc 
tho 
d 

ISeIJTD NO: 

in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Un known; *=Stop 
codon: /^DOSsible nucleotide rfptptfnn* V=nnccihlf> 
nucleotide insertion) 







TSCIIISKCANL 

211 

'5708 

A 

215 

1 

2953 

MKWVTFISLLFLFSSAYSRGVFRRT 

PLGPASSLPQSFLLKCLEQVRKIQGD 

GAALQEKLCATYKLCHPEELVLLG 

HSLGIPWAPLSSCPSQALQLAGCLS 

QLHSGLFLYQGLLQALEGISPELGPT 

LDTLQLDVADFATTIWQQMEELGM 

APALQPTQGAMPAFASAFQRRAGG 

VLVASHLQSFLEVSYRVLRHLAQPG 

GGGDAHKSEVAHRFKDLG\EEDFT 

ALVLIAFAQYLQQ*PFEDHVKLANE 

ATEFAKTCVADESAXENCDKSLHTL 

FG\DKLCTVATL\RETYG\EMADC\C 

AKQGT*GEMECFFATQRMDNPNLP 

PIG WRTRG WM WMLHCFFHDNEGD 

IF* KKYLL WKLPGRTSFTFYGPRELL 

FLWLKR/RIKAGFLQEC\CQGWLD*S 

WPACLAKGSDELSGMKGKASXSAK 

QRLKCASLQKIWEKELSKPWAVAR 

LSQRFPKAEFAEVSKLVTDLTKVHT 

ECCHG\DLLECADDRA\DLA\KYICE\ 

NQDS1SSKLKECC\EKPLLE*FH\CLA 

EVENDEMP\ADLPSLAADF\VEN\KD 

VXCKNYAEAKDVFLGMFLYEYARR 

HPDYSVVLLLRLAKTYETTLEKCCA 

AADPHECYAKVFDEFKPLVEEPQN 

LIKQNCELFEQLGEYKFQNALLVRY 

TKKVPQVSTPTLVEVSRNLGKVGS 

KCCKHPEAKRMPCAEDYLSVVLNQ 

LCVLHEKTPVSDRVTKCCTESLVNR 

RPCFSALEVDETYVPKEFNAETFTF 

HADICTLSEKERQIKKQTALVELVK 

HKPKATKEQLKAVMDDFAAFVEK 

CCKADDKETCFAEEGKKLVAASQA 

ALGLTPLGPASSLPQSFLLKCLEQV 

RKIQGDGAALQEKLCATYKLCHPE 

ELVLLGHSLGIPWAPLSSCPSQALQ 

LAGCLSQLHSGLFLYQGLLQALEGI 

orJbLOr 1 Lu 1 JLvJLU V ADr A 1 1 1 WQQ 

MEELGMAPALQPTQGAMPAFASAF 

QRRAGGVLVASHLQSFLEVSYRVL 

RHI A HP 

212 

5709 

A 

216 

1060 

1259 

TKFGQHGKTPSLLKPKLAGHGGAH 

L\KSQLPGRHENHLNPGGGGCSEPR 

LCHCTPAWVTKRDCLKK 

213 

5710 

A 

217 

2 

354 

SAAAGQGEENQLEASLDALLSQVA 

T^I l^"KJQl /CP COT PkVrSPP \/\l/T> A 1~M T XT 

DRDEDLMRQTEGRVPVFSHEWPD 
HLRTKPDPEVEEQEKQLTTV 

214 

5711 

A 

218 

90 

329 


215 

5712 

A 

219 

2 

632 

QPSFLCVILVYLGDQPVPIGAEKRRS 

TLEASLDALLSQVA*SEELSGEFHL 

QVG\DEYGRLTWPSVLDSICLAFLD 

SMNTLNKVLKHEKTPAVP*PG HHSS 

GCCLQDRR*KISCRQT*KDGCLFSA 

H*GKSLDHLEKPSLDP*KLEEQEKQ 

LTTDCSPAFGADAAQKQIQSFE*NV 


WO 01/79449 PCT/USO 1/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
(ho 
d 

SE?JlD NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 

Pnnfin* ^znnconlf* nii^fonfirto sf»f*>tinn* \=rt#iccitiltt 

nucleotide insertion) 







FQTFLGENQQRGSEDSESWRFSGRT 
QRI 1 TPTHT 

OrVL>JLf 111 LJ 1 

216 

5713 

C 

220 

309 

479 

MIHYSSSYSFKSSRELHIKFKFPVST 
SCGAFGSKIKWKVLSEWEETQESE 
QPEVL* 

217 

5714 

A 

221 

76 

525 

PQPLGPQPQQRPSRLASCCGAAAPC 
SWVEGAIGHAPPHGLPIMSNGYRTL 
SQHLNDLKKENFSLKLRIYFLEERM 

v!y^ I XiAoJKAcLU I JVKN 1 K V c» V Co 

LKRELQDKKQHLDKTWADVENLN 
SQNEAELRPQFEERQQETEHVYELL 
EK 

218 

5715 

A 

222 

534 

1310 

PRNEFTQQFCFIDSFFLVTLKIEALQ 

CSHRSRRSGEKVPFVQTYSLRAFEK 

PPQVQTQALRDFEKHLND1AKKENF 

SLNVRIYFLEERMQQKYEASREDIY 

KRXNTELKVEVESLKRELQDKKQPS 

GLKPWA\DVENLQPVQNEAELRRQ 

FEEP\QQE\TEHVYEL\LENKMQLLA 

KKJNc>KJ-.A 1 fc. IMKvjWl^LLVhKyKK. 

GV*TWKLSGET*RESPKNWGRCPR 

EPQVKPDPLHLRPLAQKGKDLKK1 

MLGSPNHIKNASDQ 

219 

5716 

A 

223 

32 

360 

TGSKIRNIKGIHIGREEMKLILFTNYI 

I t?/KTDI/'r\ rfd/ - f TAT TCD V* A T\T A 

1* V t^Kc/NrKJMr KlyJLAJLloK. Y *A 1 V A 
GCNIYIPPTPKLNFDIVG*ILLAKKLF 
TNANNNIRYLGINLIINDGHHLSKEI 
YIISL 

220 

5717 

A 

224 

2 

761 

APTPTGQRVVRATPAQSAPVRLRRR 

SYDVNNPIPSNLKSEAKKAAKILRE 

FT\EITSRNGPDK1MPGSTVIAKAK\G 

LANSCLLNQSPGSLVTFQRGGPGVL 

VARL\PDGK\WSSPFS/ALGIAGFG\G 

GFEIGI*GIQTLVIILEF/DDPCC*EAF 

AKGGNLTLGGNLTVAVGPLGRNLE 

UIN V\ALKooAAVr 1 YCKoKUJLrAO V 

SLEGSCLIERKETNRKSVQVKVIL1E 

SVMRK*YFKS*YNLQSTFIYSFYNM 

WF 


J / 15 

A 

99^ 

900 


CHUC^DT TY^ATMJT I/"C*1//~fcriT>T}/~ , /-»T_I 

ov^rir OKL. 1 AUHJL.K.O* VyUKrOQ H 
GEIPSLQKIQKLAGHGGASL*SQLLG 
RLRQENHLNPGGGGCSEP\RTPGWA 
TE*DSV 

222 

5719 

A 

226 

198 

660 

LLLALLFNTVLRFTVCLFLFQAPILK 
SPCCSAARVDRRKSIWVDGL*ICSR 

L»oK. V LiKJ x r J\r V V VJIIj \J * * JLtoN 

f *L/HLNIEKN*GLTG * VSILCKCLFYH 
SL*PLL*VKCSLPJ>GWTHTCNLSTL 
GGRGGRIT* VQEFETSLGNIVRHRI 

223 

5720 

A 

227 

1 

347 

GERLAGRRRKMAVESRVTREEIGN 

T^C*T/"DTr^DT71/''T/^ , D7 CT T> \ A lTTTTirXT/^ T> 

JUo*JKJ'lL'Kfcl^iCr'LoLK.\Ar i 1NNOK 

HHRMDDFSRGNVPYSELQTYTWM 
DAT\LKDLTSLAQELYPQATLNGTH 
FTF A V ALTH ATPPfiSR VND 

224 

5721 

A 

228 

3 

225 

SCQGERLAGRWRKMAVESRVXTQE 
EIKKEPEKP1DREKTCPLLLRVFTTN 
NGRHHRMDEFSRGRWSKAPGKQK 


WO 01/79449 


PCT/US01/08656 


SEQ 10 

nucleotide 
sequence 

SEQ ID 
(NU: oi 
peptide 
sequence 

Me 

tho 
d 

SEQ ID NO: 
in UlssIN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location oflast 
codon Tor last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







GP 

225 

5722 

A 

229 

1984 

2676 


226 

5723 

A 

230 

2 

590 

GCRNSARGKMAVESRVTQEEI\KKE 

PEKPIDREKTCPLLLRVFRTNNG\R\ 

HHRNGRVSPRGNVPSTELQIFPLGW 

MPTLKELTSLWKEVYPEARKKGTH 

F\NFAIVFNRCLKVPGYR*FSFLQS* 

GGLASTHVWAERGLDDSHEPLQSR 

KF\Q1GDYL\DIA\ITPPNR\APPPSG\R 

MRPYLNSNFTYLFEFYFFPSVM 

227 

5724 

A 

231 

1 

291 


228 

5725 

A 

232 

3 

320 

AKNRLQILKFCLHFKERKTVLPSKH 
AVPEVIEDFLCNFLIKMGMTRTLDC 
L/QASEWYELIQKGVTELRTVGNVP 
DVYTQIMLLENENKNLKKDLKHYK 
QAAEYVIF 

229 

5726 

A 

233 

209 

461 


230 

5727 

A 

234 

104 

609 

RQPGTRGTRRTRWRLEGAYYLEQV 

T1TEASEDDYEYEEVTC*F*IPDDNF 

S1PEGEEDLAKAIQMAQEQATDTEIL 

ERKTVLPSKHAVP\*VIEDFL/RCNFL 

DQNGELTRTLDCFQSEWYELIQKG 

VTELITVGNVPDVYTQIMLLENENK 

NL*KDLKHYKQAAEYVIF 

231 

5728 

A 

235 

222 

502 ■ 

TSLIKHYISNLFTFINSVEYKQ*WFL 
LWLCVSLKC*LGQAWWAQACNLS 
TL*GPRWAADHLRSEVRDRTG\QH 
GETPSL/LKNTKISWAW*WVPV 

232 

5729 

A 

236 

565 

779 

APGVRD*PGQHGENLSLQK*KLKK 
LAGHGG1HLCFQLLRRPRQKYRLSP 
EGQDCSE^MVCTLAWATEQDPVS 

233 

5730 

A 

238 

656 

923 

VPVHRGKERGGIQDLDEIATPTLLS 
KSSSFFKTSYCTDFFLFLTESCCVTVR 
LECSGMISAHCSLCLPGSSNSAPTSP 
VSHNKDRLLLHL 

234 

5731 

A 

240 

171 

373 

AWLCANKTLFLNFYLFFETRSC/SLS 

RLECNNAHAHCSLLLPGPSDSPTSA 

SQVAGTTRTCHDTQPI 

235 

5732 

A 

241 

915 

1283 

QRQGRLGLWDNEEGEIGTKYSSFKI 
DTVEKLFLGGGRSRVKPRGSNKAR 
DPPSFPSPAWEVGPQLGVPLKSPCG 
LHLGLAAVPLYDPRGGGPHTPPHTP 
P/PTPHHPPHPHPPHTKHTPPTNTQ 

236 

5733 

A 

242 

555 

767 

NKKDLFSLRSGDQKSKVKTSEGPRL 
/PLRGIRENP/CPPVPAPGGPRHCLAC 
GGITPVSACIITRISCPLYSN 

237 

5734 

A 

243 

2 

744 

GTMAVFVVLLALVAGVLGNEFTIL 

KSPG SVVFRNGN WPIPGERIPNV AA 

LSMGFSVKEDLSWPGLAVGNLFHR 

PRATVMVMVKGVNKLALPPGSVIS 

YPLENAVPFILDSVANSIHSLFSEETP 

VVLQLAPTEERVYMVRKANSLFEY 

LS1TFLQLHNRLFQKNSVLTSLPLTS 

I xrxrK.rk.rt7F ot t r?T7/o *t f\r\T ri*ro\tn o 

LNNNNbLHLLFF/S*LQPLH*ISNFLS 
CDKHFTQKMIVLINNHSNLPMLPTK 
FGNPFLTKSFPSFPNLSLKPFSA 

238 

5735 

B 

244 

385 

544 

MTGSPEDDETGYPLRSPGQERSSFT 


WO 01/79449 


PCT/US01/08656 


SEQ ID 

VHKJi 01 

nuclco-Iide 
sequence 

SEQ ID 
oi 

peptide 
sequence 

Me 

mo 
d 

SEQ1D NO: 
09/770,160 

Nucleotide 

locmiun ui 

first codon 
for peptide 
sequence 

Nucleotide 
location oi l&si 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \ppossibIe 
nucleotide insertion) 







EKPMDNAATSGIRSPGIGQFPFRKTT 
DPX* 

239 

5736 

A 

245 

1 

449 

GNEFSILKSPGSVVFRNGNWPIPGER 

IPDVAALSMGFSVKEDLSWPGLAV 

uNLrHKrKAl VMVMVKGVNKLAL 

PPG\SVISYPL\ENVDLLFLSELQVLH 

DISSLLSRHK\HLAKGSILPDLYSL\E 

RAGFGME1GKRYGEDSEQFRDASKI 

240 

5737 

A 

246 

2 

1230 

GAGRVRARHLLTLRLSPCPAGPFRV 

APQCCGRRGTMAVFWLLALVAG 

VLGNEFSILKSP\GSWFR\NGNWPIP 

RE\RDPPDVAAI/SPMGFSVKEDLSW 

PG\LAVGNLFHRPRATVMGDG*RG 

VNKTWLYPQGSV1\SYPLE\NAVPFS 

LDQCLQIPIHFLIFLEETSCLFLQLGF 

PVRE\RV VLWLGKANFSV* RTFSVT 

L\RQLR/NIRLVFKENSVSSVSLPLNS 

LSRNNEVDLLFLSELQV\LHDISSLL 

SRHKHLAK\DHSPDLYFTGSWAGL\ 

DEIG\KALLGEDSEQFRDASKILVD\ 

ALQKF\ADDMYSLYGG\NAVV\ELV 

TVQSF\DTSL\IREGQGTYSLEGKTSA 

GTPASPYNLAYKYNFEYSVVFNMV 

LW1MI\ALALA\VIITSYNIWNMDPG 

YDSIIYRMTNQKIRMD 

241 

5738 

A 

247 

1547 

1965 

AQGRFQALCSLVAVRAWGWPLSG 

NSFSCGNSQCVTKVNRSVTTRRTAP 

MGPTRRMRVWLAASWRMAGRIVG 

GMEASPGSFRGKPAFERTRSTSVGR 

HHQRQPLRS*NHRFQDPTKWVAYV 

VRPTSAARRPAPCGPSKKA 

242 

5739 

A 

248 

403 

734 

MAVQAGTQCLVQQLHSGFLQHLW 
LDHCRPRKMLTEVLLEVAPA* DQA 
LLAGWEDVCGSREAHGLD\GRPKG 
RGLVSSSTATSKSAVSALYRGCLTI 
WTTWARTVLASEPLR 

243 

5740 

A 

249 

1 

552 

MVWSSQRCCRKHCGAAGPGTVCQ 

LVRPLLTDRMVCAGYLDGKVDPAR 

PQKNTDTSVSNAGRFTDIWMPVLE 

EFKAVGIERQNVGPGLNGEAHPGR 

GRVRSCLREVPWQVSLKEGSRHFC 

EQLWWGTAGCCLPPTASPVSGIKA 

L/YESELADARRVLDETARERARLQ 

IEIGKLRAELDEVNKR 

244 

5741 

A 

250 

63 

497 

LPDVEKLGRRRGRKMDSVEKGAAR 

LR\PNPRGRPSRGRPPKLQRNSRGG 

QGRGVEKPPHLAALILARGGSK\GIP 

¥ TS"K T T TV TFT A /"^ T fit T T ,/"*"* f I M /T T% A A Y ITSn/t 

LKNIKHLAGVPLIGWVLRAALADSG 
AFQRCACARVGGAAWAGVGRGSR 
AAGGAGASGATALGRGPSLMPGM 
C 

245 

5742 

A 

251 

1 

349 

GTRAVVCGRRLISVREQIRHFVMRP 
EINTNHLDKQQVQLLAEMCILIDEL 
DNVQAYCETKKNCHLNENIEKGAAL 

JvVJ 1 LLJLoULUK^rKr Ablvo 1 LrisJiV 
QTSVIPYFLVGSSSFK 

246 

5743 

A 

252 

2 

423 

LRWSL/DS V AQ AG VQWGDLG SLQ A 
PPPGFTPFSCLRLPSSWDYRCQPPRT 


WO 01/79449 


PCTYUS01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEIPd NO: 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
vuuuu, / pus&iDie nucieonae ueieuon; \— possiDie 
nucleotide insertion) 







KKUr 1 V LAKM V SIS* PCDPPAS ASQ 
SAGIIGM/SHRARPGFPT/CQTTQEPG 
GTTSHGYRIPPP*QDLC*LPQFPERG 
SGSQRC*DKPGSPSL 

247 

5744 

A 

253 

891 

1564 

SPRALAANPWWMVTSVSSRVKQSC 

TQQGGFVPLAQQVHSPIISELFYALV 

SLFFYFLFDICRARILSGSFCILRTLL 

LLLFLRRSLNSVTQAGVQWRDLGS 

LQAPPPGFTPFSRLSLPSSWDYRRLP 

KKJrANr Vr VrLVb 1 GrHRDETRlVSI 

S\GPRDPPASASQSAGITGVNHRAW 

PTFCIFCRDRVSSCWPGWSRSHTPG 

LKRSSCLSLPKFWDYRHKLPYP 

248 

5745 

A 

254 

6 

338 

MEPSCGLGSEALALTQTWAGSHSL 

T/" \/T?T_T'T'C' \/OD n/ - * T> /"* T&T"> T" T"\/"l / /-"« x r-t rr\rv 

K.YrH I SVSRPGRGEPRFlYVGYVDD 
1 \IL VKl-L»lNU/ArorKJVlVJr^KArWlEH 
EGSKIWDRETHIAKDTRQIFRVNLR 
TLRSYYDQIEAGD 

249 

5746 

A 

255 

2 

424 


250 

.5747 

A 

256 

25 

486 

EFHRLRENPPWCLSPADKTNVKA\A 
WG\KVGAHAVRSMCAEALERMFLS 
FPTTKTYFPHFDLSHG\SAQV*GATG 

A T"\ A T TTwT A WAT T\ 7\ r"\T"Vfc /fTYVT\ A T f* 

KKV\ADALTNAVAHV\DDMPN\ALS 
AL\SDLHAHKL\RVDPVQLSSS*SHC 
LLO^rWPAHLrRPSr rPGuCTPSLG 
v^voWArt 

251 

5748 

A 

. 257 , 

230 

358 

FLI1LRRSLILSPRLECNGSVPAHCSL/ 
RTPGFKRFSCLSLSSS 

252 


A 

4 Jo 


loo 


253 

5750 

A 

259 

340 

535 

FRFKALFDFLFLVEIASCCVAQAGV 
QWCDLSSVQPPPPGNSSDSPTSASQI 
AGTRGALQHAWLIF 

254 * 

5751 

A 

260 

1618 

1962 

DRVSLLSPRLECSGTIL\AHCKLR/LP 
GrTLFSCLSL\PSSWDYRRLPPRPAN 
r r Vr L VvbMGr HR VSQ/AMGLDLLT 
SGDPPASGLSKCWGLQGVSNLRPS 
QASPSFKGIKGPQTLRA 

255 

5752 

A 

261 

3 

395 


256 

5753 

A 

262 

152 

514 

LATLLGPWSCARVPSVPALLTPPPL 
AGPPPPQPLLQRLCSGPRLLLLSLGL 
bJ_,l_.l_L. V VUCVIGoV^NoyH^bbLKGL 

t> tjtpcxiitvt'a cvx7nx>ncj> at ctyvd a 
KJc J r i>INr \ i Ao I curuoJvALd J yOKA 

MWGRKMEVRLEFPVWRKQQ 

257 

5754 

A 

263 

138 

1072 


258 

5755 

A 

264 

1 

488 


259 

5756 

A 

265 

1 

2105 

FRAASCAPPSWRMELRSGSVGSQA 

VARRMDGDSRDGGGGKDATGSED 

YENLPTSASVSTHMTAGAMAGILE 

HSVMYPVDSVKTRMQSLSPSSQSPV 

VPSIYGALKKIMRTEGFWRPLRGVN 

VMIMGAGPAHAMYFACYENMKRT 

LNDVFHHQGNSHLANGIAGSMATL 

LHDAVMNPAEVVKQRLQMYNSQH 

R^AT^PTRTVWPTPnf nAPVRQVTT 

IVOAVlO^Hxl V VY lx 1 X_»VJL(Vj/\r X i\0 I 1 1 

QLTMNIPFQSIHFITYEFLQEQVNPH 
RTYNPHSHIISGGLAGALAAAATTP 
LDVCKTLLNTQENVALSLANISGRL 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQTO NO: 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







VVPMGPLLPNALERGGDGTAAHRK 

AVCGDIREVWELDRLLPCDIRDGAF 

ITMPFHCYAQNRGEGLLRPAELAD 

GAAPRELGQPGGGPEDGWGQPRW 

RRRQGPPPGREDYENLPTSASVSTH 

MTAGAMAGILEHSVMYPVDSVKPR 

ARPRLLAALRRGRRSGEHRWLRRR 

LGSRGTRSLKLCTVLPRWPFGLAGA 

AHTCAVSEGVPRRGSPHHAGAEKR 

VALARPRALGTWCVAAAPRVISGT 

WGRQVFSRLVAALYRFDSGPWDPL 

SEGSCTSSPDFGSPSRREAMTFAFSF 

CLRGGRHMPSLREHYWARMSHER 

HKDWANVGGTfTVf SFPNFI INNTR 

LARNRTPWARHDNWCHHWOHVSP 

ESSLDCVRLQGLPWMAAAEVEMK 

LPAGHMHMPVSFPNRSPLGAGCIN 

260 

5757 

A 

266 

882 

1299 


261 

5758 

A 

267 

1 

2607 

MAFAWWPCLILALLSSLAASGFPRS 

PFRLLGVANGIEVYSTKINSKVTSRF 

AHNVVTMRAVNRADTAKEVSFDV 

ELPKTAFITNFTLTIDGVTYPGNVKE 

KEVAKKQYEKAVSQGKTAGLVKA 

SGRKLEKFTVSVNVAAGSKVTFELT 

YEELLKRHKGKYEMYLKVQPKQL 

VKHFEIEVDIFEPQGISMLDAEASFIT 

NDLLGSALTKSFSGKKGHVSFKPSL 

DQQRSCPTCTDSLLNGDFTITYDVN 

RESPGNVQIVNGYFVHFFAPQGLPV 

VPKNVAFVIDISGSMAGRKLEQTKE 

ALLRILEDMQEEDYLNFILFSGDVST 

WKEHLVQATPENLQEARTFVKSME 

DKGMTNINDGLLRGISMLNKAREE 

HRJPERSTSIVIMLTDGDANVGESRP 

EKIQENVRNAIGGKFPLYNLGFGNN 

LNYNFLENMALENHGFARRIYEDS 

DADLQLQGFYEEVANPLLTGVEME 

YPENAILDLTQNTYQHFYDGSEIW 

AGRLVDEDMNSFKADVKGHGATN 

DLTFTEEVDMKEMEKALQERDYIF 

GNYIERLWAYLTIEQLLEKRKNAH 

GEEKENLTARALDLSLKYHFVTPLT 

SMVVTKPEDNEDERAIADKPGEAS 

YQPPQNPYYYVDGDPHF1IQIPEKD 

DALCFNIDEAPGTVLRLIQDAVTGL 

TVNGQITGDKRGSPDSKTRKTYFGK 

LGIANAQMDFQVEVTTEKmCGTG\ 

RA\STFSWLDTVTVTQDGLSMMINR 

KNMVVSFGDGVTFVVVLHQVWKK 

HPVHRDFLGFYVVDSHRMSAQTHG 

LLGOFFOPFDFKV^DTR PO^nPTKPD 

SIGTKVVCWFVHNNGEGLIDGVHT 
DYIVPNLF 

262 

5759 

A 

268 

1 

1842 


263 

5760 

A 

269 

3 

377 


264 

5761 

A 

270 

1 

621 

MTKRCLDHRGEWLPGA GGGGHTE 
GTRCLHHAPVTWVGIEVDIFEPQGI 


WO 01/79449 


PCT/US01/08656 


SEQ ID 

iH\J. 01 

nucleo-tlde 
sequence 

SEQ ID 
inu: oi 
peptide 
sequence 

Me 

4.1. _ 

ttio 
d 

SEQ ID NO: 
in UfiwIN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







SMLDAEASFITNDLLGSALTKSFSG 

KKPVWLRGRHTPKGNLDSEVLAGL 

SPCPIPLAGLTVNGQITGDKRGSPDS 

KTRKTYFGKLGIANAQMDFQVEVT 

TEKITLGTG\RA\STFSWLDTVTVTQ 

DG*APLQGLQGGLQGEGDHSGPQP 

NPGALSEPELV 

265 

5762 

A 

271 

3 

2722 

FSDGLCMVALSHLGSALQLGSLCFP 

RSPFRLLGKRSLPEGVANGIEVYST 

KINSKVTSRFAHNVVTMRAVNRAD 

TAKEVSFDVELPKTAFITNFTLTIDG 

VTYPGNVKEKEVAKKQYEKAVSQ 

GKTAGLVKASGRKLEKFTVSVNVA 

AGSKVTFELTYEELLKRHKGKYEM 

YLKVQPKQLVKHFEIEVDIFEPQGIS 

MLDAEASFITNDLLGSALTKSFSGK 

KGHVSFKPSLDQQRSCPTCTDSLLN 

GDFTITYDVNRESPGNVQIVNGYFV 

HFFAPQGLPWPKNVAFVIDISGSM 

AGRKLEQTKEALLRILEDMKEEDY 

LNFILFSGDVSTWKEHLVQATPENL 

QEARTFVKSMEDKGMTNINDGLLR 

GISMLNKAREEHRIPERSTSIVIMLT 

DGDANVGESRPEKIQENVRNAIGG 

KFPLYNLGFGNNLNYNFLENMALE 

NHGFARRIYEDSDADLQLQGFYEE 

VANPLLTGVEMEYPENAILDLTQNT 

YQHFYDGSEIVVAGRLVDEDMNSF 

KADVKGHGATNDLTFTEEVDMKE 

MEKALQERDYIFGNYIERLWAYLTI 

EQLLEKRKNAHGEEKENLTARALD 

LSLKYHFVTPLTSMVVTKPEDNEDE 

RAIADKPGEDAEATPVSPAMSYLTS 

YQPPQNPYYYVDGDPH/FSIIQIPEK 

DDALCFN1DEAPGTVLRLIQDAVTG 

LTVNGQITG\DKRGSPDSKTRKTYF 

GKTGASPMAQMGFPGWEVTTEKIT 

LLEQARCRAFFSWLDTVTVTVQDGH 

FLASSRRLSMMINRKNMVVSFGDG 

VTF VVVLHQA^ CWKKHP VPTVDFL 

GFYVVDSHRMSAQTHGLLGQFFQP 

FDFKVSDIRPGSDPTKPDATLVVKN 

HQLIVTRGSQKDYRKDASIGTKVVC 

WFVHNNGEGLIDGVHTDYIVPNLF 

266 

5763 

A 

272 

1168 

1626 

' RAGRGGEGHKLNSYGGRRARSQG 
HLLSSALSPFVSAASYQPPQNPYYY 
VDGDPHFIIQIPEKDDALCFNIDEAP 
GTG\LRL1QDAVTGLTVNGQITGDK 
RGSPDSKTRKTYFGKLGIANAQMD 
FQVEVTTEKIT\CGTG\RA\STFSWLD 
TVTVT 

267 

5764 

A 

273 

534 

690 

FVIFFSPCSIAMATKENMTSQRGML 
KSIH\SKMNTL\ANRFPA\VNSLIQRV 
NL 

zoo 

5765 

A 

274 

3 

946 

TTKMAAGTSSYWEGEARRPPDLRK 
QARQLENELDLKLVSFSKLCTSYSH 
SSTRDGRRDRYSSDTTPLLNGSSQD 
RMFETMAIEIEQLLARLTGVNDKM 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ1D NO: 
in I ISSN 
09/770,160 

Nucleotide 

first codon 
for peptide 
sequence 

Nucleotide 

Inmtiftn nf IslQf 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
Luuuiif /^pussiDie nucjeoijue oejenon^ ^possiole 
nucleotide insertion) 







AEYTNSAGVPSL\NAALMHTLQRH 

RDILQDYTHEFHKTKANFMAIRERE 

NLMGSVRKDIESYKSGSGVNNRRT 

ELFLKEHDHLRNSDRLIEETISIAMP 

TKENMTS\QRGMLKSIHSK\MNTLA\ 

NKr PA VN SL1QKJNLRKRRDSLILGG 

VIGICTILLLLYAFHLMGHLQGLLTA 

TAFTPWSGIRKHRREKLTVLIISLTS 

JKMiNAKJL 1 VML/aV 1 Wo(jr 


J /OO 

A 
A 

Z / J 



V MA V LroU I ALK I N WfcFVjKLDLQC 

"KinQQT T I QHADUTVCT 1 t~ZVD T"D AT/TP 

IN uooL L/L.0L1 Ar rll V oULAjr KIKAJv I (j 

RARC\HACNPNTLGGRGGRI 

270 

5767 

A 

276 

2 

424 


271 

5768 

A 

277 

3 

452 


272 

5769 

A 

278 

3 

498 

PTLLVPTDSERTHHGSCFLPDKTNV 

KA\AWGKVGAHAGEYGAEALERM 

FLSFPT\TKTYFPHFDL\SHG\SAQV\K 

GHG\KKVADALTNAVAHV\DDMPN 

\ALSALSDLHAHKL\RV\DPFNFKLPS 

H\CLLVTL\AAHLPAEFHPLRWHALP 

u 1 brLGrLLSrVADLPNTR 

273 

5770 

A 

279 

333 

538 

IFSSLWLFFILSIKDFILFYFLFLAQSR 

SVTNRLECSGTISAHCNLCLPNSSDF 

RVLRLGNRLRLKIKK 

274 

5771 

A 

280 

192 

607 

GRLWGCVSKKSVGCLPHPGCLWA 

AFLTLDACGLPSSPWMPVGSLPHPG 

CLWAAFLTLDACGLPSSPWMPVTW 

FPWGLPKLRDPKPPSNLMTRPVSE\P 

PVLSPSPSPTPSATRPTHFPSLKGPA 

HRPAHVFPFNPCFVP 

275 

5772 

A 

281 

17 

363 

GLESEFLLRGLLRPGEQDSALASAV 
PGSLAQTLPFPWS/PLW/TMSFPAHA 
APHPACCHCLSY/PVSCPVSVPSLLP 
LGCPLQLLPSCPNSCYPSPAVPTYCP 
AGKEEKRRSPSCQACS 

276 

5773 

A 

285 

96 

389 

QGPAEENMAAKMFEFIGKFGLALV 

DAGGVVNSALYSVDAGHRAVVFD 

RFRGVQDIWGKGTYWLIPWLQKS/ 

HFDCRSQPRNVLVFTGSKDLQ1GNL 

H 

277 

5774 

A 

286 

1 

390 

FFYFFFLERDFLFLFYFIFFAVLLLLP 
NLECNGAISAHKNLRLPG\SSDSPAS 
ASQVAGITGMQHHAWLSFVFLVKT 
GFVHLGHAGLKLPTSDDPPTAASDI 
VGITGMIPPVAGPKQRHFCARSVLV 
PFI 

278 

5775 

A 

287 

16 

546- 

QLNGRSIRHEVMSHRKFSAPRHGSL 

GFLPRKRSSRHRGKVKSFPKDDPSK 

PVHLTAFLGYKAGMTHIVREVDRP 

GSKVNKKEWEAVTIVETPPMVVV 

GIVGYVETPRGLRTFKTVFAEHISDE 

/CRLLPLRQKKAHLMEIHVNGGTVA 

EKLDWARERLEQQVPVNPVFGQDE 

IViiL/ VI 

279 

5776 

A 

288 

1 

625 

CKFIRVMAHTRLRLLPLRRKKAHL 

MEIQVNEGTVAEKLDWARERLEQQ 

VPVNQVFGQDEMIDVIGVTKGKGY 


WO 01/79449 PCT/US01/08656 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in IISSN 
09/770,160 

Nucleotide 
location of 
first codon 
Tor peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno>vn; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







KGVTSRWHTKKLPRKTHRGLRKVA 

QKDGKLIKNNASTDYDLSDKSINPL 

GGFVHYGEVTNDFVMLKGCVVGT 

KKRVLTLRKSLLVQTKRRALEKIDL 

KJ" ID n SKFGHGRFQTMEEKKAFM 

UrLKKDKJAKJibGA 

280 

5777 

A 

289 

1 

903 


281 

5778 

A 

290 

38 

482 


282 

5779 

A 

291 

1 

1131 


283 

5780 

A 

292 

1 

1329 

STHASDGVMSHRKFSAPRHGXSLGF 

LPRKRTS\RHRGKVKSFPKDDPSK\P 

VHLTAF\LGYKAG\MTHIVREVDR\P 

GIHRCNKKERWWRAVTHCMRPPP 

MVVGGHLVG\YVET\PRGPPGPFKT 

CLLLEHI\SDELPRGVFYKEFGH*NL 

KKKAFTK\YCKEIGKDED\GKKPAW 

KKDFQQH*KKLLAQVHPCSIAQTQ\ 

MRLLPL\RQK\KAHLMEI\QV\NGGT 

VA\EKL\DWAREKLE\QQ\VPVNPSV 

LGRMRMIDYVIGGDQRAKGYKGGS 

PS\RWHTKKAAPAKTH\RG\LRKVG 

LVLGAWHP\ARVAFSVG/RAAGQK 

GYPSTALEINK\KIYKIGPGVTLSRA 

GSLIKEQCLHLNYDLSDKSINPLGGF 

VHYGEVTNDFVMLKGCVVGTKKR 

VLTLRKSLLVQTKRRALEKIDLKFI 

DTTSKFGHG RFQTMEEKKAFMG PL 

KKDRIAKEEGA 

284 

5781 

A 

293 

238 

326 

HTYKSDTRYERHACWGALL/CNYM 
RQECLDSRFVFDRPMPVFRLVSVIG 
TSILYMKAFMHMPFK 

285 

5782 

A 

294 

2 

358 

G WGMSLGGAG VEGMEVGTSDLG F 

FSGQRALSPWVSPVPPGLCAWRKD 

SPVEQKPQGPSLPLSALPYLWG/AP 

WPPAGPQTRGLGPFRGTGSPPSIPIS 

RAQKDSWPWPVPSTPACFSAPG 

286 

5783 

C 

295 

56 

175 

MASXNRQQFFXNTPXKLLKSPHCNI 
YRLLSAKSQGKFWK* 

287 

5784 

A 

296 

1178 

1515 

KKFMKILEHMFEGFFFSFLNFFIFSG 
GRRSALTARGGSEVAANLGLTCNL 
HPPGFKRFSCLRLRSSWDYRRPPPR 
PANFWFSVETGFCYVGQAGLKLLT 
SSDPPASAFPKC 

288 

5785 

A 

297 

136 

251 

IHQEKPPN1FSVKKRHYD*PGQHDP 
LASASQSAGITGV 

289 

5786 

A 

298 

118 

337 

IHQEKPPNIFSVKKRHYD*PGQYGK 
TLSLLKIQILAGYSGTCL\KSQLLRR 
VGREVIQLALKIRAPIWKIECL 

290 

5787 

A 

299 

160 

437 

KRDITTSLGQYGQNP\SLLKIQILAG 
Y\SGTCL\KSQL\LRRLRHQNRLNLG 
GRGVGSEQRSCHLHSWGGHSETVSK 
KKKKRERQQWRQ1GTCMP 

291 

5788 

A 

300 

61 

1302 

FSGSCVPPRTCGLCW1STGQSGVVSI 
VSSTRLEESEGTQPPSPSSDTGSEGE 
EDDEGEEHGLGGQNEVGIIPTTLEFL 
ENHGKNILLSNGNRTVTRVASYNQ 
GIVVFNQPLVPQLLVQVRIDFLNRQ [ 


WO 01/79449 



PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possib)e 
nucleotide insertion) 







WTSSLVLGVITCAPERLNFPASAC\P 

SNGQPGCCGAVGSSTTSQAGLSSQI 

CEKFGPNLDTCPEGTILGLRLDSSG 

GLHLHVNGVDQGVAVPDVPQPCH 

ALVDLYGQCEQVT1VNPEPGAASG 

KSAGTQGDMEKADMVDGIKESVC 

WGPPPAASPLKSCEYHALCSRFQEL 

LLLPEDYFMPPPKRSLCYCESCRKL 

RGDEAHRRRGEPPREYALPFGWCR 

FNLRVNPRLEAGTLTKKWHMAYH 

GSNVAAVRRVLDRGELGAGTASILS 

CRPLKGEPGVGFEEPGTNC 

292 

5789 

A 

301 

I 

936 


293 

5790 

A 

302 

1 

1023 


294 

5791 

A 

303 

1 

867 


295 

5792 

A 

304 

1 

569 

SGRVAMGRRRAPAGGSLGRALMR 

HQTQRSRSHRHTDSWLHTSELNDG 

YDWGRLNLQSVTEQSSLDDFLATA 

ELAGTEFVAEKLNIKFVPAEARTGL 

LSFEESQRIKKLHEENKQFLVVYRG 

DQTWNQNTTPEELKQAEKDNFLEW 

RRQLWRLEEEQKLILTPFERNLDFW 

RQLWRVIERSDIWQIVDA 

296 

5793 

A 

306 

846 

1070 

RVGDRSERErVILKTNFTYFQVFPKA 
GCGCFSFLFSFFLSFFFLRGETESRSV 
AVRMKCSGVISAHCNLCLPGSS 

297 

5794 

A 

307 

118 

340 

KFQTEVSHFFLCNLICSYFIFFLL/CS 

FLLIHF/LYSLFFFLLFCFMFFLFIMIY 

/LFFVLLIRYSYIKSLLFLMSCN 

298 

5795 

A 

308 

42 

352 

TRGPRVPHSGSASSPAQKSGCTG/P* 
NSALARPAL VSFRAMPNSRG W/PQG 
EQR/PGSPHHRSPEGHWKRVHVPPA 
AQRGPGAGGCHQGTGPEAQGAHQ 
VRPPAQGG 

299 

5796 

B 

309 

796 

3180 

VAEAPGLVDVPGGHPEPQSCEKLE 

NTGGKIGHRKKMPYSTPAPCVSPLK 

LDLWLSVRERTPDSGSLTLLHCATS 

DPQGQQALCPGGSPQHQDLAGQLV 

VHELFSSVLQEICDEVNLPLLTLSQP 

LLLGIARNETSAGRASAEFYVQCSL 

TSEQVRKHYLSGGPEAHESTGIFFV 

ETQNVRRLPETEMWAELCPSAKGA 

IILYNRVDVVLASTPMRICPPAAMPP 

LLPLRLCRL WPRNPPSRLLG A AA GQ 

RSRPSTYYELLGVHPGASTEEVKRA 

FFSKSKELHPDRDPGNPSLHSRFVEL 

SEAYRVLSREQSRRSYDDQLRSGSP 

PKSPRTTVHDKSAHQTHSSSWTPPN 

AQYWSQFHSVRPQGPQLRQQQHK 

QNKQVLGYCLLLMLAGMGLHYIAF 

RKVKQMHLNFMDEKDRIITAFYNE 

ARARARSVPALFCSLLPVQEPHFGIP 

IPTTQAPVSQPDAPGHQRKVVSWID 

VYTRATCQPREVVVPLTVELMGTV 

AKQLVPSCVTVQRCGGCCPDDGLE 

CVPTGQHQVRMQVLGTWGNGQG 

MQILMIRYPSSQLGEMSLEEHSQCE 

CRPKKKDSAVKPDSPRPLCPRCTQH 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 

111 VkAJll 

09/770,160 

Nucleotide 

Iftpotinn of 

first codon 
Tor peptide 
sequence 

Nucleotide 

Inrntinn nflncf 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \-possible 
nucleotide insertion) 







HQRPDPRTCRCRCRRRSFLRCQGRG 

LELNPDTCRFSCLSTAGSLLQLTDV 

WWLLGRLKISLVGEQAQPDHSSHE 

SQPRCTGRVLSICLSAVATATGAEG 

KRKLQIGVKKRVDHCPIKSRKGDV 

LHMHYTGKLEDGTEFDSSLPQNQPF 

VFSLGTGQVIKGWDQGLLGMCEGE 

KRKLVIPSELGGATLVFEVELLKIER 

RTEL* 

300 

5797 

A 

310 

61 

674 

GCGTLGPLQWDFPEPGCKGMMAPL 

AEGQSSAHISVWGNLRTFCVSTKKI 

PVDSGASGSPTQVSASLTCSESQAA 

LDIELGTGLGNNLVSFRGDAKQAG 

AGLRVNKRAGSPPSTRSPEGHWKR 

VHVPPAAQRGPGGWGLPPRAHGPE 

AQGAHQVRPPA\QGPQPPAGSGAG 

RQGSHRLWLVQRPPPVGPPDRPAC 

HPSRWHPAVAA 

301 

5798 

A 

311 

89 

1166 


302. 

5799 

A 

312 

1 

2094 

MGAPAVQSSSGPAGARPRKAGVER 

RAEPAGPGLPETTRKSPQPILGFSLR 

AVVWDLFPGSKQIVRRKLPIPGQAV 

LVQADVATLTSRRVLHACGLVPLE 

MPCIQAQYGTPAPSPGPRDHLASDP 

LTPEFIKPTMDLASPEAAPAAPTALP 

SFSTFMDGYTGEFDTFLYQLPGTVQ 

PCSSASSSASSTSSSSATSPASASFKF 

EDFQVYGCYPGPLSGPVDEALSSSG 

SDYYGSPCSAPSPSTPSFQPPQLSPW 

DGSFGHFSPSQTYEGLRAWTEQLPK 

ASGPPQPPAFFSFSPPTGLSVPSLAQS 

PLKLFPSQATHQLGEGESYSMPTAF 

PGLAPTSPHLEGSGILDTPVTSTKAR 

SGAPGG\SEGRCAVCGENASCQHY 

GVRTCEGCKGFFKRTVQKNAKYIC 

LANKDCPVDKRRRNRCQFCRFQKC 

LAVGMVKEVVRTDSLKGRRGRLPS 

KPKQPPDASPANLLTSLVRAHLDSG 

PSTAKLDYSKFQELVLPHFGKEDAG 

DVQQFYDLLSGSLEVIRKWAEKUP 

GFAELSPADQDLLLESAFLELFILRL 

AYRSKPGEGKLIFCSGLVLHRLQCA 

RGFGDWIDSILAFSRSLHSLLVDVP 

AFACLSALVLITDRHGLQEPRRVEE 

LQNRIASCLKEHVAAVAGEPQPASC 

LSRLLGKLPELRTLCTQGLQRIFYLK 

LEDLVPPPPIIDKIFMDTLPF 


JOUU 

A 
J\ 

111 

cOo 

1 1 A O 

1 143 

QLVPCCPPTQRTVQKNAKYICLAN 
KDCPVDKRRRNRCQFCRFQKCLAV 
GMVKEGVWL/RVRPTGARVGLSGV 
RPPGPPGFCPGGPTGGHVLFPPHL 

304 

JOVl 

A 

71/1 

1 Gfk 

lyy) 


biUKJCQDLSICCLQVTHFTFKDSQRL 
KVKGWKKMFHTNKNQKRIWT 

305 

5802 

A 

315 

190 

324 

ERIKKQDLSICCLQVTHFTFKDSQRL 

306 

5803 

A 

316 

85 

310 

CAWHVN1LIGKRLNTFPYRSGTRQG 

CMLLPFLFNTILKDLVTALKNQDIK 

GKQ1K/EEIKLSLFTEMITRVDKNQS 


WO 01/79449 j^ fc PCT/US01/08656 


SEQ 10 
NO: of 
nucleo-tide 

SEQU) 
NO: of 
peptide 

Me 
tho 
d 

VbNO: 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 

sequence 

Nucleotide 
location of last 
codon for last 

nminn nrift nf 

peptide 
sequence 

Amino acid seqilHR? ( X-Unknowo; *=Stop 
codon; /^possible nucleotide deletion; \=possib!e 
nucleotide insertion) 

307 

5804 

A 

317 

33 

494 


308 

5805 

A 

318 

1 

612 


309 

5806 

A 

319 

113 

551 

LLWRESAVTALWGKVNVDEWGGK 

ALGRLLVVYPWTQRFNFESFGDLST 

PDAVMGNPKVKAHSVKKVLRGAFS 

GG\LAHL\DNLKGTFAHTEVSLHC\D 

KL\HVGSWRTFRLLG\NVLVCCCWA 

HSLLGKEFQPHQLQACLIKKIGWLG 

VG 

310 

5807 

A 

320 

221 

376 

• DRVSIPRLESSGA1LAHCNFRL/SGFK 
QFSCLSLPSSCDYRCVLPRRALCSSC 

311 

5808 

A 

321 

32 

452 


312 

5809 

A 

322 

72 

570 

SRRAWVSFTEEDKATITSLWGKVN 

VEDAGGETLGRLLWYPWTQRFFD 

SFGVNLTCASA1MGHPKVKWHGKK 

VLTSLGDAI\EHLDDLKGTFAQLSEL 

HCDKLHVDPENLKLLGNVLETALAI 

/HFSAKQFTPEVQASWQKMGD\GV 

ASALCFTKHLDFMCMMQSFQR 

313 

5810 

A 

323 

35 

359 


314 

5811 

B 

324 

102 

431 

MIIYRDL1SHDEMFSDIYKIREIADGL 

CLEVEGKMVSRTEGNIDDSLIGGNA 

SAEGPEGEGTESTVITGVDIVMNHH 

LQETSFTKEAYKKYIKDYMKSIKGK 

LEEQRPDR* 

315 

5812 

A 

325 

132 

708 

RRRRLPSVAIMIIYRDLISHDEMFSDI 

YKIREIADGL\CLEV\EGKMVSRTEG 

NIDDSLIGG\NASAEGPEG\EGTRST\ 

V1TGV\DIVMNHHLAGNKFSQKEAY 

KKYIK\DYIEIQFKGETLKEPEDQKR 

VKPFYDRGLQEQFKHILG*FSKTYQ 

FFIG\ENMNPDGMVALLDYREGWV 

*PHI*FSFKDG\LEM\EKC 

316 

5813 

A 

326 

1 

5796 


317 

5814 

A 

327 

3 

467 


31S 

5815 

A 

328 

73 

1593 


319 

5816 

A 

329 

57 

1358 

RRKVAMDL1PNLAVETWLLLAVSL 

VLLYLYGTRTHGLFKRLGIPGPTPLP 

LLGNVLSYRQGLWKFDTECYKKYG 

KMWGTSSLFGPHYPSSYEALGGSC 

VRLLLCVTP**TRT*GCCVSYN*GT 

YEGQLPVLAITDPDVIRTVLVKECY 

SVFTNRR1CATTSTIKMQTHSVTMW 

LPPAVLQSQHGVCLFL*QSLGPVGF 

MKSAISLAEDEEWKRIRSLLSPTFTS 

GKLKEKRHHKIHYKMSLTAPCWRK 

PYPSGT*VCTFNYSIFpAYSMDVITG 

TSFGVNIDSLNNPQDPFVESTKKFL 

KFGFLDPLFLS1ILFPFLTPVFEALNV 

SLFPKDT1NFLSKSVNRMRKSRLND 

KQKHRLDFLQLM1DSQNSKETESHK 

ALSDLELAAQSIIFIFAGYETTSSVLS 

FTLYELATHPDVQQKLQKEIDAVLP 

NKVRG 

320 

5817 

A 

330 

870 

1150 

HRLDFLQLMIDSQNSKETESHKALS 
DLELAAQS1IFIFAGYETTSSVLSFTL 
YGTGPLHPDVQAGNCKREIDAVLP 


193 


WO 01/79449 PCT/US01/086S6 


SEQ ID 

INU. 01 

nucleo-tide 
sequence 

SEQ ID 

fSyjl 01 

peptide 
sequence 

Me 

mo 
d 

SEQ ID NO: 

09/770,160 

Nucleotide 

first codon 
for peptide 
sequence 

Nucleotide 

lUldMOU OI IRSI 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon* ^possible nucleotide deletion; \— possible 
nucleotide insertion) 







NKVAPPTYGAVGTDGSYL 

321 

CO 1 o 

Oslo 

■ A 

A 

n i 
331 

1 A A 

144 

Oil 

RRLCMjlb rSCHCIlTNElr IFIFFEAE 
SHSVA\RLECSGAVLAHCKLCLPGL 
RHCPASATREAEAREWLETRSRRL 
Q 

322 

5819 

A 

332 

3 

323 

DRVSLSLSPRLECNGMISTHCNLHF 

PGaSDoPDTP/SQVAEITGVHHHAQL 

IFVFLVETRFHH1GQAGLELLTSSDL 

PTSASPSAGIIGVRHCAWARITFQRT 

KCFSI 

323 

5820 

A 

333 

i on 
187 

450 

NYVSQK^KKLNSPINYXKEIEFIVLK 
LPKUCKPLGPNGFTAEFYQTFKKGM 
\TPILDHLLQKIDVTLPYLFYKTDFT 
LTLKPKT1QKTRA 

324 

5821 

C 

334 

122 

292 

MMCSMTLSFIFSFMRKLCRS1RASS 
WNSPWFRVSGCPSFTEYWWKVLM 
MVYMLRSS* 

325 

5822 

A 

335 

295 

931 

VLSRKCQRSLTAFSSKCPNSWFS1TQ 

TECKTMTCGMPQHVTQQ*RPIINTS 

HQYSVKLGHPRHPETRGRFKELVR\ 

KDLQNFLKKENKNEKVIEHIM\EDL 

DTNADKQLSFR/EEFIMLMGEA*PG 

AFPRRKIARGLTEGPGnHHHKPGPG 

GGAPPKDHSGPRFTVGHGHGHSTW 

WPRPQATNHGGQATLPLPNHRPRG 

LLCQTVLAVGLGAGAK 

326 

5823 

A 

336 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLWYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAl 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

327 

5824 

A 

337 

3 

556 

HSLFGTSEVINKLRSPDA\MGHFTEE 

DKATITSLWGKVNVE\DAGGETLGR 

LLWYPWTQ\RFFDSFGNLSSASAIH 

GQPPKSRHMGKKVLTSLGDAIKHLN 

DDLKGHLLPKPEVNCTCDKAALLD 

PEELSSFLGEMLLG/VPVFGQSHFRA 

KEFHPWRLQGFPGISRRWQKMVTV 

GV\ASALVPSRYH 

328 

5825 

A 

338 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLWYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAl 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

329 

5826 

A 

339 

38 

547' 

APSPDAMGHFTEEDKATmSLWGK 

VNVEDAGGETLGRLLVVYPWTQRF 

FDSFGNLSSAVSAIMGNPKVKAHGK 

KVLTSLGRCHKSTWDDLKGYTFAQ 

A*SE\LH\CDK\LHV\DPGGTFKLLGK 

MLLG*PV\LAIPFSAKEFHP*RLQAS 

WQKQKMAEDGDLELASALVPSRY 

TT 

ri 

330 

5827 

A 

340 

168 

330 

SSLGLDLVCGDMAKCTKKVRIISKY 

GTRYGASLRKMVKXRIAITQHTKYI 

CSSRA 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO- nf 

peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

111 UOoPI 

09/770,160 

Nucleotide 

location UI 

first codon 
for peptide 
sequence 

Nucleotide 

lumtiuii ui 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 

331 

5828 

A 

341 

2 

355 

ARATMVLSPADMTNVKAAWGKVG 
A HAG EYGAEALERMFLGLLTTKTY 
FSHFDLSHGSAQVKGHCMKVVDAL 
TvNAGINVDNL\PNAL\DTLIDLLTPIF 
CRSLLNFYLISNSLFIIISVH 

119 


A 

A 


1 HC 
1 10 

A 1 A 
410 

AGLLPDP/TITARMNVGVAHSEVNP 
NTRVMNSRGI WLAYIILVGLLHMV 
LLSIPFFSIPGGWTLTNVIHNLATYV 
FLHT 

333 

5830 

A 

343 

469 

708 


334 

5831 

A 

344 

49 

351 

ATSPD\AMGHFTEEDKATITSLWGK 

VNVEDAGGETLGRLLDGYPWTQR 

GFDSFGNLNYTSDVMVDPKFMGHG 

MKVLTYLGDALCDLDDTNGNFAH 

VSTVMC 


5832 

A 

*y a c 

345 

665 

921 

AKKKEKKTGALSARRQPNPPTQNT 
PHPHPPNPTPHHPPPPSPPTPNPHSPPP 
FLILQKLLLIAVTIFDPTYCVISYSW 
VIMTFNKL 

336 

5833 

A 

346 

2 

341 

HEEGFVNPGARFCLPEAAAVRRPPG 
EATVIMSDQEAKPSTEDLGDKNEGE 
SIKLPA^LAHDRTETHFNVKTTTHLT 
SLPQSYCQIQAVPLNSLTLLFARPTT 
AAHHTPPELPMQ 

337 

5834 

A 

347 

209 

397 ; 

VSLWQEAMRLPKNTPEEKDRRTAA 
LQEGLRRPVSVPLTLAENGARLWS 
DMENLSDIYWYASE 

no 
JJCt 

5835 

A 

A 

*J A O 

348 

87 

356 

IHFYRVKIFFHILCFYIFIQICHYSFIF 
YFFCRQG/HLSPRLEGSGAILAHCNL 
CLLGSNDPPTSASRVAGTAGTHHH 
AWLIFVFFIETGY 

339 

5836 

A 

349 

3 

204 

KMEARKQRESMRGREAEREKEKG 

YERSSEGERVV\ERNIGHKRRRDAK 

REARWEKIHGAKEARRNRYK 

340 

5837 

A 

350 

3 

341 

HERHEIPIIKMSHRGPWLMVDFLSY 
KLSQNGYSWSQFTDVEENT\TEAPE 
RTELDVRTTPIAINGNRSWHLADSPA 
VNGTTGHSSSSDARDVIPMAAVQH 
ALWEASDEFELRHR 

341 

5838 

A 

351 

67 

541 

EAPARRALCGRVPSEAQRDGHQAP 

LLSRRRRL*AFFVADGIFKAELNEFL 

TRELAEDGYSGVEVRVTPTRTEIIIL 

ATRTQNVLGEKGRRIRELTAWQK 

RFGFPEGSVELYAEKVATRGLCAIA 

QAESLRYKLLGGLAVRRCAGNQSE 

DHACLGTNW 

342 

5839 

A 

352 

3 

495 


343 

5840 

A 

353 

1 

459 

EDGYSGVEVRVTPTRTEIIILATRTQ 

NVLGEKGRRIRELTAVVQKRFGFPE 

G S VEL Y AEKVATRGLC AI AQ AESLR 

YKLLGGLAVRRACYGVLRVIMESA 

AKGCEGWSGKLRGQRANS/MKFG 

KAGGFPGKLVNYYCALVGPLCAYT 

vj v v vjJri 

344 

5841 

A 

354 

1 

885 

SWSTHASVSAERGGKMAVXQISKK 
GEFV\ADGIFKAELNEF\LTPQLAED 
G\YSGVEVRVTPT\RTEII1LATR\TQN 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotideinsertion) 







V\LGEKGR\RIRELT\AV\VQKRFGFP 

RRASVELYA\EKVGHYRSCVAIAQG 

RSLCVYKLLRKGFACAGGPCWGV 

AAGSIMEKWGPKAFEVCWWSGKT 

SEEQRA*IP*SFVEWP*WIHSGDPV* 

LTNVDTAVR\HVLLRQG\VLG\IKVK 

IMLALGTQLGKIGPKKPLPDHV\SIV\ 

EPKDEIL\PTTPIS\EQKGGKPVTALH 

GPTRSPQPNRVSLAAVFWSLDVAL 

345 

5842 

A 

355 

1 

284 

SLFLYTANSRLGPLVSPAFMPHRISC 

NVTKGLPHDHYACLQEIKSSYKFYR 

YFETQQQSVPQCLSRTHQKSRALN 

NVYSAVRRLQVHMKALLNE*VSPA 

FMPHRISCNVTKGLPHDHYACLQEI 

KSSYKFYRYFETQQQSVPQCLSRTH 

QKSRALNNVYSAVRRLQVHMKAL 

LNE 

346 

5843 

A 

356 

1 

1404 


347 

5844 

A 

357 

1 

771 


348 

5845 

A 

358 

3 

913 


349 

5846 

C 

359 

461 

667 

MRMTMMMMMIHLKLILILMMMM 
KSMEPLLEGAYDPADYEHLPASAEI 
KELFQYISRYTPQLIDLGTTN* 

350 

5847 

A 

360 

76 

158 


351 

5848 

A 

361 

1 

2313 


352 

5849 

A 

362 

788 

926 

PSPELPEGDFEGFFPQKLQ* SCLPTL 
QKKKNNNNNN^^NNNNNNNEK 

353 

5850 

A 

363 

168 

447 

TGTPGYACNSQNLGGPTGGISRSPV 
*NQPGQKGETPGFLKIPKLTRGGGR 
ALQFQVLGRVRPENPLNLCGQNFN* 
PKLCPCTSTWGKIRLPF 

354 

5851 

A 

364 

637 

1258 

VLFLRKPTPAACLGHALSHRNLGPS 

AANSPSVLGKPAPSWSHVPATVLPG 

GQQGTPCDMRVSGTVRVGSTVMST 

TSIPALPHLGSTSVGPPQPGGHEKQ 

MITWCKDRLQLTHSDEGFGVGFFQ 

TTMYILASKMCTGAQRSGCWALRV 

PQEDGKNQLIRFYCMYVCIYFETES 

HSVVQAGVQWRDLDSL*PPSPEFKR 

ISCLSFLSSW 

355 

5852 

A 

365 

217 

481 

KCSFQM*IYRLKNYNNNHSHPFSISL 
FLISSNIQNNFGSRYN*NHLKMYKT 
EAQRLTCSMLHKSNPHLFILNRMFL 
TRNLLGPHSLVP 

356 

5853 

A 

366 

1 

245 

PVPRGGSKLLTHHLAPLTLPKAGDS 
GVNPRVPPFFLSPPAIWGPKPKILGL 
AKTPVPRFPLGKKFFPSP*FPPFFPK 
NKTL 

357 

5854 

A 

367 

145 

196 


358 

5855 

A 

368 

120 

173 


359 

5856 

A 

369 

138 

321 

NECLLGSFFSV/PNSSLLK*KS*ASA 

VAHTCNPSTLGG*GGW1T*GQEFET 

SLANMVKPCLY 

360 

5857 

A 

370 

1536 

1629 

KSQKACNPSTLGG*GGWIT*AQEFT 
TSLANT 

361 

5858 

A 

371 

11498 

11651 

LKNNFKKCTMWA\GMVADTCNPST 
LGGRGGWIT*GQGFKTSLANMMKP 


WO 01/79449 


PCTYUS01/08656 


SEQ ID 
NO: of 

sequence 

SEQ ID 
NO: of 

sequence 

Me 
tho 

SEQ ID NO: 
in USSN 

09/770 ISO 

Nucleotide 
location of 

firct rnrinn 
■ iiai luuuu 

for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; V=possibIe 

nitrlpntirip ind*k*tinn\ 







CLY 

362 

5859 

A 

372 

15 

272 

RLJCAATLKMPGSAAKGSELSERIES 
FVETLKRCGVPRSSEDTARVTLLM 
MRWIFNDHRWIPP*ELVDPYIYFPW 
PCSTLSCWDWS 

363 

5860 

A 

373 

433 

612 

QAPLQKPTVRR*K*VREIRGRD*VE 

E*IEEWYR*RSGRETRRGRESGR*ER 

GEVDREERKRE*GSRVRRGRERRG 

RERRGRERGGEESQEGKREKRKRE 

RRGREKRGREERKRAKEVFKDGER 

PRAKVGVIVLFCRFQ 

364 

5861 

A 

374 

785 

1178 

ALGCPGCPLLAVSGKDHNSSTQPAT 

HNSRDRRERRKEERERRGRERGEEE 

IEEGKR*RSGRETRRGRESGR*ERGE 

VDREERKRE*GSRVRRGRERRGRE 

RRGRERGGEERQEGKREKRKRERR 

GREKRGS 

365 

5862 

A 

375 

1969 

2208 

GANPIHDLHPHDLTTSHRPHIFI1FFE 
MESRSVTQAGVQWHDLGSLKSPPT 
GLKLFSCLSLPSG *NYRCTPSHLANF 
CIF 

366 

5863 

A 

377 

171 

442 

GKKWSFSLQNWHVQAY*LSCNRY 
CSLKDHDFITPSDGGPDIFLH1CDVE 
GEYVPVEGDEVTYKMCSIPPKNEKL 
QAVEVGITHLGPGTQH 

367 

5864 

A 

378 

3 

775 

SVHSSAHASERVAEQNGLQGQAMS 

SVPSPPPQPPTHQA\GVGLLDTPRSR 

ERSPSPLRGNVVPSPLPTRRVTRTFSA 

TVRXASQGPVYKGVCKCFCRSKGH 

GF1TPQLMAAPDIFLHISDVEGEYVP 

\VEGDEVT\YKM\CSIPPKN\EKLQ\A 

VEVVIT\HLAP\GTKHETWSGHVISF 

LGDGGSTPCPVLVGRLCGEEAADT 

GDD1LPHETGLQRGNGPSHVSPGGK 

GYGGAGVGCGVFPAISTAYGPLQQ 

PLHHLKSIKSI 

368 

5865 

A 

379 

7 

316 

APSPDAMGHFTEEDKATITSLRGKE 

NVEDAGG*TLGRLLDDYPWTHRIL 

DS*GKLLSDYAIMGKQDDKEHAEK 

ELPSLEDALAHWADASASGHWPSD 

VPCAYR 

369 

5866 

A 

380 

61 

304 

ARTWNSVRMASSGMTRRDPLANK 
VALVTASTDGIGFAIARRLAQDGAH 
WVSSRKSQNVDQV*VST*LASV*L 
IYLMCVLP 

370 

5867 

A 

381 

2 

281 


371 

5868 

A 

382 

2 

558 

HSLLERLRLSISFLVQTPIGHSTEEDV 

KATIXTSLWGKGEMWKNAGRKKPL 

GRPPGLSLPQWTPRGSFEQALG\NL 

VSSCPPAPSMGKPPQKSKGTMAKK 

GA*PSLGKMPIKAPLDD\LKGT\FAP 

A* SELH\CDKLrWLDPENFKLLG\N 

VLVTWLAIHFGKEFTPEVQASWQK 

MVTAVASALSSRYH 

372 

5869 

A 

383 

3 

368 

EFFCGLCVKSEISLHLFCLANFFPSL 
KPQITSSGEMVPLLPCQS*EWRRKD 
ESSTLPPPPSSGAECCPTWLRPSPSTS 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 

SEQ ID 
NO: of 
peptide 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 

fnr* npntiftp 

sequence 

Nucleotide 
location of last 
codon for last 

sminn arid of 

peptide 

sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







PCPCLPHPYVQGSLCETQSHLTVNP 
ASSSYRISPPPLISSRTRY 

373 

5870 

A 

384 

179 

455 

EFGWGGGKSLGLPRAGLD*IGGSLG 
FIPLLSTPVSHSHAFSVGAITALIFLL 
ESLAFQWLLLLSSSHFLYFSLLFFRQ 
SSFCFLTEEQKKKK 

374 

5871 

C 

385 

22 

423 

MKAAVLTL AXLFLTG SQARHF WQ 

QDEPPQSPWDRVKLHELQEKLSPLG 

XEMRDXRAPMWTXXNASGPLQRR 

VRXL WPRA LRLSRRT A AQT WP STT 

XRPPSILSTFSEKGQARVRGTSAKA 

CXPLLESXKGXVS* 

375 

5872 

A 

386 

1 

671 

SGRIQEVPHGPFRMKAAVLTLAWL 

FLTGSQARHFWQQDEPPQSPWDRV 

KDLATVYVDGLTEDSGKDSVTSTFS 

KLRE*LGPVTQ\EFWDNL\EKETEGL 

RQE\MSKDL\EEVKAKVQP\YL\DDF 

QKKWQEEMELYRQKVEPLRAELQE 

GARQKLHELQEKLSPLGEEMRDRA 

RAHVDALRTHLAPYSDELRQRLAA 

RLEALKENGGARLAEYHAKATEHL 

STLSEK 

376 

5873 

A 

388 

24 

499 , 

HTDTYPHPHLIARPQGFPELKNDTF 

LRAAWGEETDYTPVWCMRQAGRY 

LPEFRETRAAQDFFSTCRSPEACCEL 

TLQVRGPQKRERFMPSVCHLATCL 

LFFTAPLRRFPLDAAIIFSDILVVPQA 

LGMEVTMVPGKGPSFPESLREEQDL 

KRLLDPEMV 

377 

5874 

A 

389 

* 

109 

750 

HTDTYPHPHLIARPQGFPELKNDTF 

LRAAWGEETDYTPVWCMRQAGRY 

LPEFRETRAAQDFFSTCRSPEACCEL 

TLQPLRRFPLDAAIIFSDILVVPQAL 

GMEVTMVPGKGPSFPEPLREEQDLE 

RLRDPEWASELGYVFQAITLTRQRN 

LAGRVPLIG\FAGAPW\TLMTYMGFI 

LTWTQNMWAPLWMLCINTHVCFD 

RTECIPLPSSTTNTDD 

378 

5875 

A 

390 

1 

295 

PQTQREPAMVLSPADKTNVKAAW 
GKVGAHAGEYGAEALERMILFFTT 
TRTYFPRLDLSLLSDPV*FPVITEAF 
ARTYSGVIADLLSNTEPHMIQMAAS 

379 

5876 

A 

391 

112 

310 


380 

5877 

A 

392 

49 

615 

RAQRGCSQSCGKMNARGLGSELKD 

\SFPVTELSASGPLES\HDLLRKGF\S 

CVKNELLPSHP\LELS\EKNFQLQPR 

LK*NFSTLEETFQGSILLPLKITGGDF 

QGQCRQV\QRLPFSFQAPNLSTGMV 

FEGGNDETIWDLEDIL**SHHKSEV 

HGESHTFDGWEYKPWVYCNSSAGS 

WKPRAAILFIVIFVL 

381 

5878 

A 

393 

167 

1955 

LCPHVVEGMWEVPVISLMRAL1PF 

MRASPSRVRRAATPAAVTCQLSNW 

SEWTDCFPCQDKK/YTVMTLSAIQT 

IQGNILISETLIMSAMAGFPNKYRHR 

SLLQPNKFGGTICSGDIWDQASCSSS 

TTCVRQAQCGQDFQCKETGRCLKR 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SExTlD NO: 

III UOOli 

09/770,160 

Nucleotide 

lUCdllUII V* 

first codon 
for peptide 
sequence 

Nucleotide 
location of Inst 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 

vvuuii > t pudaiuic uuvicuuuc uciciiUDj \-~po5siuic 

nucleotide insertion) 







HLVCNGDQDCLDGSDEDDCEDVR 

AIDEDCSQYEPIPGSQKAALGYNILT 

QEDAQSVYDASYYGGQCETVYNG 

EWRELRYDSTCERLYYGDDEKYFR 

KPYNFLKYHFEALADTGISSEFYDN 

ANDLLSKVKKDKSDSFGVTIGIGPA 

GSPLLKFIFTRIFTKVQTAHFKMRK 

DDIMLDEGMLQSLMELPDQYNYG 

MYAKFINDYGTHYITSGSMGGIYEY 

ILVIDKAKMESLGITSRDITTCFGGS 

LGIQYEDKINVGGGLSGDHCKKFGE 

RARKAMAVEDIISRVRGGSSGWSG 

GLAQNRSTITYRSWGRSLKYNPWI 

DFEMQPIHEVLRHTSLGPLEAKRQN 

LRRALDQYLMEFNACRCGPCFNNG 

\7"DTT CPTCPDPr\PDl OCT /"* A AOT7AT 

VrlJLbO 1 oCKCCjCKLvjoLOAACEQT 
V£ 1 Jb/U^UAJvALHjro Wot Woi>Wo VCK 

AGIQERRRECDNPAPQNGGASCPGR 
KVQTQAC 

382 

5879 

A 

394 

94 

276 


383 

5880 

A 

395 

25 

1876 

1LQGPACTHLLLQFPEYIALFLQGNI 

VRGLLAEMFAVVFFILSLMT*QPGV 

TAQEKGNQRVRRPATPAAVTCQLS 

NWSEWTDCFPCHDKKYRHRNLLQP 

NKFGGTICSGDIWDQASCSSSTTCV 

RQAQCGQDFQCKETGRCLKRHLVC 

NGDQDCLDGSDEDDCEDVRAIDED 

CSQYEPIPGSQKAALGYNILTQEDA 

QSVYDASYYGGQCETVYNGEWRE 

LRYDSTCERLYYGDDEKYFRKPYN 

FLKYHFEALADTGISSEFYDNANDL 

LSKVKKDKSDSFGVTIGIGPAGSPLL 

VGVGVSHSQDTSFLNELNKYNEKK 

FIFTRIFTKVQTAHFKMRKDDIMLD 

EGMLQSLMELPDQYNYGMYAKFIN 

DYGTHYITSGSMGGIYEYILVIDKA 

KMESLGITSRDITTCFGGSLGIQYED 

KINVGGGLSGDHCKKFGGGKTERA 

RKAMAVEDIISRVRGGSSGWSGGL 

AQNRSTITYRSWGRSLKYNPVVIDF 

EMQPIHEVLRHTSLGPLEAKRQNLR 

RALDQYLMEFNACRCGPCFNNGVP 

EGAKADGSWSCWSSWSVCRAGIQE 
RRRECDNPAPQNGGASCPGRKVQT 
QAC 

JO't 

3551 

A 


z 

ox)/ 

Au V " W L)LLibL(^rLPrKLK.Qr 5/CI 
LNPGNLSKEF*STKETKQNIFVGHIQ 
SQTSKFAISLIQIHPINMRSGTKTFM 
MV*GNKQRSKFPIWTFKIFPDMLPS 

j5j 

JOOZ 

A 

A 

3y / 

1*7/1 

MLS* 
ODD 

PGSSNSPGSAS*VAGTIGMHHHARL 
MFVFLVESGFHHVGQAGLELLTSSD 

rrAoAovioAOlKuloKKACjLUr 


JOOJ 

A 




QILLMTLTGKTISL*LDPSDTIVNVK 
ALIHDIERIPPDHEMLIFACKQLE 

387 

5884 

A 

399 

202 

418 

RLGGVEEGWGKGRLRLNLRLRGGL 


WO 01/79449 PCT/USO 1/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in I ISSN 
09/770,160 

Nucleotide 

luiuuun vi 

first codon 
for peptide 
sequence 

Nucleotide 

■UwHIIUIl Ul I35( 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possib!e nucleotide deletion; ^possible 
nucleotide insertion) 







Jrll i Ml 1 ILUJLNloLbDMrNr I I*N VK. 
AMILSNNGIHSHE*RLIFEGMR 

388 

5885 

A 

400 

144 

433 


389 

5886 

A 

401 

1 

3135 


390 

5887 

A 

402 

79 

929 

PVAQGMLRWTVHLEGGPRRVNHA 

AVAVGHRVYSFGGYCSGEDYETLR 

QIDVHIFNAVSLRWTKLPPVAPGEV 

CHPWASS\WPYMRYGHSSV\PSDD 

TVLLWGGRNDTEXGPCNVLYAFDV 

NtHKWFTPRVSGTVPSARDGHSAC 

VLRKIMYILGGYEQQADWFSNDIH 

KL 

391 

5888 

A 

403 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

392 

5889 

A 

404 

50 

562 

APSPDAMGNHFTEEDKATITSLWGK 

VNVEDAGGETLGRLLVVYPWTQRF 

FDSFGNLSSASAIMGNPKVKAHGK 

KVLTSLGDAI\EHLDDLKGTFAQLSE 

LHCDKLHVDPENLKLLGNVLETAL 

AI\HFGAKILPFKGRLPGRRWQKMV 

TGVASALCFTKHLDFMCMMQSFQR 

393 

5890 

A 

405 

228 

420 

TPEADALYSHNPGGNLDRHTASKPS 

ALLQPGPAWQRGSACSLQILPESRV 

GFPTGPP*ARKVSI 


com 
55591 

A 

406 

653 

940 

K WKKIN VFFETG SRSA AQARVQWC 
HLGSLQP*HPRLKEPPASASQTAGT 
TGMHHHAWLS*VSFVKMRLGHHQ 
DIRRLMDSINMPHYMHQAPPMCQ 

395 

5892 

A 

407 

795 

1802 

CRLHTQQ1QRLETASGFLRMKGKNS 

VQLQEGWERFQDPGNHITRPRPFLP 

SDPHPTLMCLQGPPTGKGPGKSRAT 

GTKAAEGA\DETSYF*NAFQLPLYK 

LIKI1RKKEK*K*KSCT*KRVRWSKL 

CPRDWAAARTEAPPTGLESRQPVC 

Q\DPPPLPTAACIPP/CWLGSF*KRM 

ND*QTKITPWG*FPHHPRL/PPSSSPS 

NSSSSPSSPSSKLSSSSMASPVKYST 

ARGTIRSRKKCPISKSEANVNSESSS 

SDSPSPDATDLPFNGLKKLKKDSLG 

1 Or VI VL, 1 VPRPLCFCFFLMVLTVTF 

FPFFQSIVHPSQSTISGPSKEKGSALS 

GSDFIL 

396 

5893 

A 

408 

342 

515 


397 

5894 

A 

409 

3 

333 

AAWLLLGAATGLTRGPA/PRPSPPR 

ALTPA*GPLAAFTAARSDAGIRAMC 

SEI1LRQEVLKDGFHRDLLIKVKFGE 

SIEDLHTCRLLIKQDIPAGLYVDPYE 

LASLRERNITEEKTSWRRLWLPSDN 

398 

5895 

A 

410 

877 

1206 

QGGQSSLGTAGPEPDSPGDPGSAAE 

PAPGPHIPATRREAREPGPLPRSGPP 

SPAPLTGVRARGGEGRGGPAREPG 

RRPEEQPGGR 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
INO: oi 
nucleotide 
sequence 

SEQID 
Kirk, nf 

peptide 
sequence 

Me 
tho 
d 

SEQID NO: 
in uoorN 
09/770,160 

Nucleotide 
locution of 
first codon 
for peptide 
sequence 

Nucleotide 

1 n o tinn nflact 
lulUllUll Ul lUal 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X^Un known; *=Stop 

f*nrlfiv%» A'nftcetKta nmtlffcjtfrifljfc #1ttlbhAn* \ — w\ f\ct*\ Vv I £% 

buuuii) /— pubiiDie nucicoime uciciiun T \— possiuic 
nucleotide insertion) 

399 

5896 

A 

411 

238 

326 

LHSGPGVVVT*YRKMTSLWAGCSR 
HACNPSTLGSRGIQITRGQEF 

A f\A 

400 

COAT 

5897 


A TO 


AHA 

tiA\\rv\AUU\Ar i X7T> uncT\/i aivt x>r\r\ 
lYlWlsJVlrtrlMUilKrlvJoi VJUAlYJjKyy 

MDfWFFQUQQfWTWTI A"MPMVI riCT T> 

lvirv^tir r orlooV^ i irilJLAJNJDlN i JLvjoI*" 
FT T I If Wl^FFTl£Pr , TPAC 1 C>jAUAnFPP 

ARRKETAAPQPCRRPAAR* 

401 

S898 

A 

413 

1 

88 


402 

5899 

A 

414 

65 

191 


403 

5900 

A 

A 

At C 

415 

131 

363 

bVKMAur LDNr RWrbCECID WSER 
RNWASVVAGILVSEKDWLVTCIPY 
LLPWKLMPVPLN*EWLSRTIYFTAV 
LYR 

404 

5901 

A 

416 

146 

567 

EVKMAGFLDNFNRWPECECIDWSER 
RNAVASWAGILFFTGWW1MIDAA 

\f\ Avnv nrA\T xttj a t?tj*t , /"»/"'\ 7t?c*t*t at? 

VV\YPKPEQ\LNHAFH rCGVFSTLAF 
FMINAVSKLLQV\RGDSYGKAAVL\ 
GRTGA\RVWAFHWGFMLMFGSL\IA 
SMWI\LFGAYVTPKYLMFIRD 

405 

5902 

A 

417 

17 

369 

KLTGFLGLGVPPKPVIPFKNRPIGPG 
PWVPPVIPAPLEAQVGGSPSPEIGAP 
PGYKGEPPFFLKPQKFTRQCGQPPL 

RPCPSTWATKPNFVS 

406 

5903 

A 

418 

553 

673 

RRIEKGQVQWLTPVISVLWEAAAG 
D*LEASSSRLYATPPD 

407 

5904 

A 

419 

2 

427 

HVIKVLHDDWIFTPFIQGP*SM/CSS 
KNESRHIGS*RVTG*LLEVLKSLL*S 
FGRLNALNMKSIjTSEVQEE*RKLN 
KTHRVQRDFDKDRKLAVGQSESPG 
HPTSEKPPSTSSSAGCMLCSLHISRG 
FQLRRKRQLNGKCCPIQ 

408 

5905 

A 

420 

82 

371 

RRHSVACTPHPSSQVLKSLL*SFGRL 
NALNMKSLKAKFRKSDVN*IKLIEC 
KEPSTEN*LLARVKVLVIRLPRNLL 
QPHRLLAVCYAAYISPLAFS 

409 

5906 

A 

421 

103 

430 

SFGRLNALNMKSLKAKFRKSDTNE 
WNKNDDRLLQAVENGDAEKVASL 
LGKKGASATKHDSEGKTAFHLAAA 
KGHVECLRVM1THGVDVTAQDTTG 

\HSALHLAAKNSHHE 

A 1 A 

410 

5907 

A 

A 

422 

Of 

2o3 

Crr 1 D I XT A I XTKytVCT V A VCX/f T/ - C PinTXTC 

oruKJLN ALNMKbLisAl^MKoU 1 Nb 
WNKNDDRLLQAV*NGDAEKVASL 
LGKKGASATKHDSEGKTA 

411 

5908 

A 

423' 

2 

424 


A tO 

412 

5909 

B 

424 

1 AO 

108 

one 

395 

VG AHAGEYG AEALEKMr Lbb FTTR 

1 irrHrULbnufCruLKunuN^ 

p a Tv^pprji^prrrrr'PTCT cat qht ha 

HKLSGGTRFNFQAPKATGLLG* 

413 

5910 

A 

425 

2 

334 


414 

5911 

A 

426 

236 

649 


415 

5912 

A 

427 

76 

322 

TNSPCYVVFGNSFFS'IIENKKQENK 
VQQAGIRLYGALLTKCPRLYSKQIH 
PALLRRLQHGVDLVYFEDILDKLIG 

nUroU V 

416 

5913 

A 

428 

988 

1223 

RGERADHLRSGIRDQPGQHGETPSL 
LITQKLAGLGSACL* SQLLGRLRQE 
NCLNAGVGGCSEP*SRHCTPAWAT 


WO 01/79449 PCT/US01/08656 


SEQ ID 

nucteo-tide 
sequence 

SEQ ID 

Ml-/. 01 

peptide 
sequence 

Me 

ino 
d 

SEQ ID NO: 

09/770,160 

Nucleotide 
location of 
first codon 
Tor peptide 
sequence 

Nucleotide 

luCmiQll Ql IU5C 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=l)nknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 







ERDS 

A 1 H 

417 

5914 

A 


57 


ERESPFAPRLEGKGANLG* WKAPLP 
GLSPFSGLSLPRTGNYGPPQPPPVNF 
F*F*GETGFPRLTREGLNLRPSENPA 
LVKPQNKVAPKHGVEKPGGK 

418 

5915 

A 

430 

291 

594 

SWLFRLGAMAHAYNNSSLGGQSGR 

IVWAQEFNTQPGQHRGDPGLYK*FF 

FL1SQCDGMHLWSQLLRRLRQKDH 

LNPRAQGCSEL*LHCCAPAWVTEQ 

DLSQ 

419 

5916 

A 

431 

27 

361 

RGPTVTPQIMAVEDVASTGADPCD 

LDSDGLLHEILTSPLILLLLGLCIFLL 

YLIVR*DQPAANGDSDDD*PSPLPR 

LKRRDFTPDDLRRFYSVQDPRILMD 

FNCKVFDVTK 

420 

5917 

A 

432 

196 

555 

SPSMNPRKKVDLKLIIVGAIGVGKT 

SLLHQYVHKTFYEEYQTTLGASILS 

KN*SYWVDTTLKVTDLGDTGGQER 

FRSMVSTFYKGSDGCILTFDVTDLE 

SFEALEFWPGGGLAQNGPNEA 

421 

591S 

A 

433 

1 
I 

685 

EIKYHSLPRLECRGEISAH*NLCLPG 
SSDSPATAS*VAGITGMRHYAQLIFL 
FLVET*FHHVGQGWSRTPDSNDPPA 
SASQGAGDYRRD 

422 

5919 

A 

434 

56 

335 

KCSPKILLTSESTSSNPCLIDTNASDF 
HFLSQVLE*VVSPKGSKEALCCILR 
HLGYETRESCPWCPSQFRYITFDMG 
SYVGPVLHHSCQALSL 

423 

5920 

C 

435 

24 

332 

MKGRTFISLLFLFSSAYSRGVFRRD 

AHKSEVAHRFNDLGEENFRALVLIA 

FAQYLQQRPFEDHVTYYAQLQLFV 

KPMVKWLTAVQnKNLREMNASCN 

TXMTTH* 

424 

5921 

A 

436 

130 

599 


425 

5922 

A 

437 

1 

404 


426 

5923 

A 

438 

3 

647 

FSLLSTPHAFGTMKWVTFISLLFLFS 

SAYSRGVFRRDAHKSEVAHRFKDL 

GEENFKALVLIAFAQYLQQCPFEDH 

VKLVNEVTEFAKTCVADESAENCD 

KSLHTLFGDKLCTVATLRETYGEM 

ADCFL\QHKDDNPNLPRLVRPEVDV 

MCTAFHDNEETFLKKYLYEIARRHP 

YFYAPELLFFAKRYKAAFTECCQA 

ADKAACLLPKLDELRDEG 

427 

5924 

A 

439 

323 

899 

MMRVFLSEKALSSSYLEMYLSTPH 

AFGTMKWVTFISLLFLFSSAYSRGV 

FRRDAHKSEVAHRFKDLGEENFKA 

LVLIAFAQYLQQCPFEDHVKLVNE\ 

AKQEPERNECFLQHKDDNPNLPRL 

VRPEVDVMCTAFHDNEETFLKKYL 

YEIARRHPYFYAPELLFFAKRYKAA 

FTECCQAADKAACLLPKLDELR 

428 

5925 

A 

440 

1 

1206 

SFSLLSTPHAFGTMKWVTFISLLFLF 

CO A VCD T /T?T> XS A 1 /AT TT> T»T^ T"\ 

bbAYbKGVFRRDAHKSEVAHRFKD 
LGEENFKALVLIAFAQYLQQCPFED 
HVKLVNEVTEFAKTCVADESAENC 
DKSLHTLFGDKLCTVATLRETYGE 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO; 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Slop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







MADCCAKQEPGRNECFLQHKDDNP 

NLPRLVRPEVDVMCTAFHDNEETF 

LKKYLYEIARRHPYFYAPELLFFAK 

RYKAAFTECCQAADKAACLLPKLD 

ELRDEGKASSAKQRLKCASLQK/PR 

NLGKVGSKCCKHPEAKRMPCAEDY 

LSVVLNQLCVLHEKTPVSDRVTKC 

CTESLVNRRPCFSALEVDETYVPKE 

FNAETFTFHADICTLSEKERQIKKQT 

ALVELVKHKPKATKEQLKAVMDD 

FAAFVEKCCKADDKETCFAEEGKK 

LVAASQAALGL 

429 

5926 

A 

441 

28 

1587 


430 

5927 

A 

442 

1 

1652 

GTMKWVTFISLLFLFSSAYSRGVFR 

RDAHKSEVAHRFKDLGEENFKALV 

L1AFAQYLQQCPLEDHVKLVN\KDD 

NPNLPRLVRPEVDVMCTAFHDNEE 

TFLKKYLYEIARRHPYFYAPELLFF 

AKRYKAAFTECCQAADKAACLLPK 

LDELRDEGKASSAKQRLKCASLQK 

FGERAFKAWAVARLSQRFPKAEFA 

EVSKLVTDLTKVHTECCHGDLLEC 

ADDRADLAKYICENQDSISSKLKEC 

CEKPLLEKSHCIAEVENDEMPADLP 

SLAADFVESKDVCKNYAEAKDVFL 

GMFLYEYARRHPDYSVVLLLRLAK 

TYETTLEKCCAAADPHECYAKVFD 

EFKPLVEEPQNLIKQNCELFEQLGE 

YKFQNALLVRYTKKVPQVSTPTLV 

EVSRNLGKVGSKCCKHPEAKRMPC 

AEDYLSVVLNQLCVLHEKTPVSDR 

VTKCCTESLVNRRPCFSALEVDETY 

VPKEFNAETFTFHADICTLSEKERQI 

KKQTALVELVKHKPKATKEQLKAV 

MDDFAAFVEKCCKADDKETCFAEE 

GKKLVAASQAALGL 

431 

Tf 

5928 

A 

443 

1 

1515 

MKWVTFISLLFLFSSAYSRGVFRRD 

AHKSEVAHRFKDLGEENFKALVL1A 

FAQYLQQCPFEDHVKLVNEVTEFA 

KTCVADESAENCDKSLHTLFGDKL 

CTVATLRETYGEMADCCAKQEPER 

NECFLQHKDDNPNLPRLVRPEVDV 

MC/H/YPNAAQNPW*TGDHAFQLW 

KSMKHTFPKSLMLKHSPSMQIYAH 

FLRRRDKSRNKLHLLSL*NTSPRQQ 

KSN*KLLWMISQLL*RSAARLTIRRP 

ALPRRVKNLLLQVKLP* AYSRG VFR • 

RDAHKSEVAHRFKDLGEENFKALV 

LIAFAQYLQQCPFEDHVKLVNEVTE 

FAKTCVADESAENCDKSLHTLFGD 

KLCTVATLRETYGEMADCCAKQEP 

ERNECFLQHKDDNPNLPRLVRPEV 

DVMCTTKCCTESLVNRRPCFSALEV 

DETYVPKEFNAE IF TFHADICTLSE 

KERQIKKQTALVELVKHKPKATKE 

QLKAVMDDFAAFVEKCCKADDKE 

TCFAEEGKKLVAASQAALGLTCEA 

CQEPGGLWPPTDAPVSPTTLYVED 


WO 01/79449 SB PCT/US01/08656 


SEQ ID 
NO: of 
nudeo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

III UoOli 

09/770,160 

Nucleotide 
lucuiion oi 
first codon 
for peptide 
sequence 

Nucleotide 
locution of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; V=possible 
nucleotide insertion) 







ISEPPLHDFYCSRLLDLVFLLDGSSR 

LSEAEFEVLKAFVVDMMERLRISQK 

WVRVAVVEYHDGSHAYIGLKDRK 

RPSELRRIASQVKYAGSQVASTSEV 

LKYTLFQIFSKIDRPEASRIALLLMA 

SQEPQRMSRNFVRYVQGLKKKKVI 

VIPVGIGPHANLKQIRLIEKQAPENK 

AFVLSSVDELEQQRDEIVSYLCDLA 

PEAPPPTLPPDMAQV 

432 

5929 

A 

444 

2 

1848 

RFSLLSTPHAFGTMKWVTFISLLFLF 

SSAYSRGVFRRDAHKSEVAHRFKD 

LGEENFKALVLIAFAQYLQQCPFED 

HVKLVNEVTEFAKTCVADESAENC 

DKSLHTLFGDKLCTVATLRETYGE 

MADCCAKQEPERNECFLQHKDDNP 

NLPRLVRPEVDVMCTAFHDNEETF 

LKKYLYEIARRHPYFYYAPELLFFAK 

RYKAAFTE\CCQAADKAACLLPKL 

DELRE*LNLQKHVLLMSQLKIVTNH 

FIPFLETNYAQLQLFVKPMVKWLTA 

VQNKNLREMNASCNTKMTTQTSPD 

W*DQRLM*CALLFMTMKRHF*KNT 

YMKLPEDILTFMPRNSFSLLKGIKLL 

LQNVAKLLIKLPACCPKLDELRDEG 

KASSAKQRLKCASLQKFGERAFKA 

WAVARLSQRFPKAEFAEVSKLVTD 

LTKVHTECCHGDLLECADDRADLA 

KYICENQDSISSKLKECCEKPLLEKS , 

HCIAEVENDEMPADLPSLAADFVES 

KDVCKNYAEAKDVFLGMFLYEYA 

RRHPDYSVVLLLRLAKTYETTLEKC 

CAAADPHECYAKVFDEFKPLVEEP 

QNLIKQNCELFEQLGEYKFQNALLV 

RYTKKVPQVSTPTLVEVSRNLGKLP 

SC**SC\CLLPKLDELRDEGKASSAK 

QRLKCASLQKFGERAFKAWAVARL 

SQRFPKAEFAEVSKLVTDLTKVHTE 

CCHGDLLECADDRADLAKYICENQ 

DSISSKLKECCEKPLLEKSHCIAEVE 

NDEMPADLPSLAADFVESKDVCKN 

YAEAKDVFLGMFLYEYARRHPDYS 

VVLLLRLAKTYETTLEKCCAAADP 

HECYAKVFDEFKPLVEEPQNLIKQN 

CELFEQLGEYKFQNALLVRYTKKV 

PQVSTPTLVEVSRNLGKLPSC 

433 

5930 

A 

445 

1 

3780 

MKWVTFISLLFLFSSAYSRGVFRRD 

AHKSEVAHRFKDLGEENFKALVLIA 

FAQYLQQCPFEDHVKLVNEVTEFA 

KTCVADESAENCDKSLHTLFGDKL 

CTVA1URETYGEMADCCAKQEPER 

NECFLQH/KCFLQHKDDNPNLPRLV 

RPEVDVMCTAFHDNEETFLKKYLY 

EIARRHPYFYAPELLFFAKRYKAAF 

TECCQAADKAACLLPK\LDELRDE\ 

UlwVooAKC^RLKCASLQKFGERAFK 

AWAVARLSQRFPKAEFAEVSKLVT 

DLTKVHTECCHGDLLECADDRADL 

AKYICENQDSISSKLKECCEKPLLEK 


WO 01/79449 


SEQID 
NO: or 
nuclco-tide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


Me 
tho 


PCT/US01/08656 


DNO: 
inUSSN 
09/770,160 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 


Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 


SHCIAEVENDEMPADLPSLAADFVE 

SKDVCKNYAEAKDVFLGMFLYEY 

ARRHPDYSVVLLLRLAKTYETTLEK 

CCAAADPHECYAKVFDEFKPLVEE 

PQNLIKQNCELFEQLGEYKFQNALL 

VRYTKKVPQVSTPTLVEVSRNLGK 

VGSKCCKHPEAKRMPCAEDYLSVV 

LNQLCVLHEKTPVSDRVTKCCTESL 

VNRRPCFSALEVDETYVPKEFNAET 

FTFHADICTLSEKERQIKKQTALVEL 

VKHKPKATKEQLKAVMDDFAAFV 

EKCCKADDKETCFAEEGKKLVAAS 

QAALGLTPLGPASSLPQSFLLKCLE 

QVRKIQGDGAALQEKLCATYKLCH 

PEELVLLGHSLG1PWAPLSSCPSQAL 

QLAGCLSQLHSGLFLYQGLLQALE 

GISPELGPTLDTLQLDVADFATTIW 

QQMEELGMAPALQPTQGAMPAFAS 

AFQRRAGGVLVASHLQSFLEVSYR 

VLRHLAQP 


434 


5931 


446 


2255 


STPHAFGTMKWVTFISLLFLFSSAYS 

RGVFRRDAHKSEVAHRFKDLGEEN 

FKALVLIAFAQYLQQCPFEDHVKLV 

NEVTEFAKTCVADESAENCDKSLH 

TLFGDKLCTVATLRETYGEMADCC 

AKQEPERNECGTMKWVTFISLLFLF 

SSAYSRGVFRRDAHKSEVAHRFKD 

LGEENFKALVL1AFAQYLQQCPFED 

HVKLVNEVTEFAKTCVADESAENC 

DKSLHTLFGDKLCTVATLRETYGE 

MADCCAKQEPERNES/CFCNHKKD 

NPNLPRLWRPEVDVMCYTAFHDNE 

ETVFLKKYLYENCPERHPLPFMAPG 

NSFSF\AKRYKAAFTECC\QAADKA 

ACL/LCPKLDELRG*KGRLRSAKQR 

LKCASLQKFGERAFKAWAVARLSQ 

RFPKAEFAEVSKLVTDLTKVHTECC 

HGDLLECADDRADLAKYICENQDSI 

SSKLKECCEKPLLEKSHCIAEVEND 

EMPADLPSLAADFVESKDVCKNYA 

EAKDVFLGMFLYEYARRHPDYSVV 

LLLRLAKTYETTLEKCCAAADPHEC 

YAKVFDEFKPLVEEPQNLIKQNCEL 

FEQLGEYKFQNALLVRYTKKVPQV 

STFTLVEVSRNLGKVGSKCCKHPG 

AKRMPCAEDYLSWLNQLCVLHEK 

TPVSDRVTKCCTESLVNRRPCFSAL 

EVDETYVPKEFNAETFTFHADICTL 

SEKERQIKKQTALVELVKHKPKAT 

KEQLKAVMDDFAAFVEKCCKADD 

KETCFAEEGKKLVAASQAALGL 


435 


5932 


447 


477 


FYNRVLLLLPRLEC*GVIFPHRNLHL 

PGSSDSHALAFRVTGITGTCHHACL1 

FVLLVETRFLHVGQAGLELLTSSDP 

PSSASQSSGITGVGHCAGPTAHFLP 

HKVLRLSTKLPSGMSPETIHPRRHA 

EKSCLFSFSLYLFHLTSSCSF1HPFSIL 

TFKC 


WO 01/79449 PCT/US01/08656 


SEQ 10 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQTD NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=l)nkno\vn; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 



c 

448 

r to 

141 

390 

MAKFSLCPPVKERGEKAHWEXXX 
XXNKATNS1CEVSTFMXXXXXXXX 
XXXXXXXXXLNIHYESDWVISKLIP 
GCIKMTEAITC* 

437 

5934 

A 

450 

345 

462 

NQRSTARGKELLQDTRALKKNS*R 
VIKYSKQQAQTCEG 

438 

5935 

A 

451 

1538 

1709 

SKCKLKQDPSHAGTSLQSQLLRRLR 

QENPLRPGFQGCSEL*SYHCTPARV 

TEQDPIS 

439 

5936 

A 

452 

243 

353 

YSYHIRVHVHTHPHLHACP* LHT VR 
YT*NSTHTHTYF 

440 

5937 

A 

453 

2 

366 

SLPASDRPPISSPLATSGTIFSAISCF 

WDLPAPFLWLAPSCOPTMSSOIRON 

YSTDVEAAVNSLVNLYLQASYTYL 

SNLQDIKKPAEDEWGKTPDAMKAA 

MALEKKLNQALLDLHALGSART 

441 

5938 

A 

454 

2 

797 

LIGKFAPRGPRIRQRRGGPARVWSL 
CFKQVFGTEQDPGILFPASGPPSDFL 
LRLQTSGTIFSAISCFLGPAQHRFLW 
LAPSCQPTMSSQIRQ\NYST\DVEAA 
VNSLVNLYLQASYTYLSLGFYFDR\ 
DDVALEGVSHFFRELAEE\KRKGYE 
RLLK\MQNQ\RGG\RALFQDIKKP\A 

QAL/LWDLHALG\SARTDPHLCDFL 
ETHFLDEEVKLIKKMGDHLTNLHR 
LGGPEAGLGEYLFERLTLKHD 

442 

5939 

A 

455 

2 

331 

FFVFCFGKRGLAVFRVEGKGMNPG 
*RNT Wl PGI KNFSGI Tl WRGGNNK 

lA.ll JL* VV JLrl VJ JL-» IV INI OVJu 1 JLt TV 1 vVJ VJ i N 1 N IV. 

PGPPLQPKFGFLKKKGFSPGGQGGF 
KIPNLEIGPNKGPKGWE*RA*PPNPS 
PSNFFNKPWVG 

443 

5940 

A 

456 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAFMGNPKVKAHGKKVLTSI GDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLA1HFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

444 

5941 

A 

457 

38 

533 

APSPDAVMGHFTEEDKATmSLWGK 

VNVE\DAGGETLGRLLWYPWTQR 

FFDSFGNLSSASAI\MGNPVKVKAHG 

KKVI T\SLGDAIK\HLDDLKG\TFAO 

A* SEPAPVTKLVH VDP\ENFKAPGEM 

LLVTR/VLAIPFSAKEFHP*RLQASW 

AE/MMGDLOLASALVPSRYH 

445 

5942 

A 

460 

3 

198 

GIPGSSFCGLCGDVPGKPV*RADGS 

C*DGVAPRLLRPRGFRGGRCGPVLD 

SLAGQRGAESGCRG 

446 

5943 

A 

461 

649 

1185 

ETCLAFMYQRTCSADSKRYIWQLF 
LEKGPMGYHPLHF*VFLGFFFFFFET 
VLAVLPQAGSVGGHNHSSIASSNHP 
RA*ANPPHLVAGDYKLTAQPGLKF/ 
VFI 1 FTGFSYVPPGWVRGST GSNGP 

V * X^fJ-fU 1 VJ 1 O I V V^I vJ VV V OVJtJljVJi*JtN VJ17 

PAPAFQRHRAKFVSFVPCHHAQQK 

GSIPFNELTFINWVMLGGASSLSWEI 

VNSS 

447 

5944 

A 

462 

1 

298 

NKEILARPNGSSPEFPPLWGLRQVD 


WO 01/79449 PCT/US01/08656 


SEQID 

nucleotide 
sequence 

SEQ ID 

peptide 
sequence 

Me 

inu 
d 

SEQ fO NO: 

in 1 IQ£TM 
in Uooli 

09/770,160 

Nucleotide 
locution ui 
first codon 
for peptide 
sequence 

Nucleotide 

IULHIIUU III lUSI 

codon for last 
amino acid or 
peptide 
sequence 

Amino acid sequence ( X^Un known; *=Stop 
codon; /^possible nucleotide deletion; ^possible 
nucleotide insertion) 







PPESGVQSPASPHGKTLFLLKKPTLT 
GQGGPNPVFPVLRRVKPQGPLNPG 
Ouur H'FKSCPCPPE W G AKLDPVF 

448 

5945 

A 

463 

179 

351 

RHVGIKHGDHEATEKFIDEFAKVIA 
DKHLTLEQVYNANETSLF*HYYPR 
KTPITAAE 

449 

5946 

A 

464 

1 

327 

PGVPMQRAEFEQPYKRSRCDDSPRT 

PSNTPSAEADWAPGLELHPDYKTW 

GPEHGCSFLRRGGFDKPVLLKNIRE 

NEITGALLACPDESSFENLGVSVLR* 

T'KLLNYYS 

450 

5947 

A 

465 

261 

452 

GDLRVTGAPSVSLSP*LGLP*VSRP* 
VPSPLASGTSKPLARFPEEAVGFSRP 
GLCLLISFPGL 

451 

5948 

A 

466 

362 

991 

PSRHLSWLWGSTGCRNAHVQLAG 

GAGARAGEERPCFPRPELAGTVSPG 

DKSLRQFGEKGGGGHERMQGPHHS 

SKESGGQSHGEDPSLEASPPKPESPA 

SQVPMKSPPVIPGETAHGLP*VSRP* 

VPSPLASGTSKPLARFPEEAVGFSRP 

GLWSAMQAGVCDQGICAIRNSPQT 

TQGGRRP*ERRCRYMHVTTEKAAF 

TPSAPRECLPH 

452 

5949 

A 

467 

24 

436 

RFIVLVHYISAPGELCRGWGSPKME 

GWGKRTSCQSLPKAGRSPGSLSRTD 

EYCGHRLPDNV*ATGGGQGPPAPG 

MGVRNPSPAPRTSPGWRVPSNTAP 

QLLGCFGGQTGRVPF1QPDPSSSSG 

MRNSPPGRGCLESA 

453 

5950 

A 

468 

2 

424 


454 

5951 

A 

469 

3 

452 


455 

5952 

A 

470 

2 

467 

PDSSGPHRLRENPPWCLSPADKTNV 

KAAWGKVGAHVGEYGAEALERMF 

LSFPTTKTYFPHFDLSHGSAQVAKGH 

G\KKVADALTNAVAHVDDMPNALS 

ALSDLHAHKLRVDPVNFKLLNSHCL 

LVTLXAAHLPAEFTPCGGTASLVDKF 

LGFLLKQRC 

456 

5953 

A 

471 

61 

346 

VRARVPSPAAAMGCTLSAEDKAAV 

ERNKKIDRNLREDREKAAKEVKLL 

VLGAGESGKSAIGKPMEIIHEEGYIQ 

DEWKPFKGIVYSNTLQA1IGT*KAA 

VERNKKIDRNLREDREKAAKEVKL 

LVLGAGESGKSAIGKPMEIIHEEGYI 

QDEWKPFKGIVYSNTLQAIIGT 


5954 

A 

A 

472 

828 

1066 

QAQWLTPCNAQHFARPRRANHLRL 
GV* HQTGQHGKTPSLLKEKYKKKK 
KVASRSHMSVIPTMWKAEAQELLE 
PGRQRSQ 

458 

5955 

A 

473 

180 

350 

EPMAKGKTESPGPKRCGP*I*WVIS 

QRGTLRFRGAGLFFMGEFLRLGENL 

LEIPRGA 

459 

5956 

A 

474 

1689 

1856 

GRCH1TCVKHSHGAADFDTTF1LFY 
rlLr YrlLr Ir * 1 ESCSVTQAGVQRGN 
LGSL 

460 

5957 

A 

475 

115 

324 

SNFQLSRKLYF*FFQGKSKHNEYFII 
FE*T*ILHFLNLGIVIYNYGTSFRKNR 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucicu~uue 
sequence 

SEQ ID 
NO: of 

peptide 
sequence 

Me 
tho 

SEQ IO NO: 
in USSN 

Uy/77U,loU 

Nucleotide 
location or 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







MKRKWVNDKMGQQQKHG 

461 

5958 

A 

476 

310 

633 

RFSLGEQECEVCYRLRPTPGWTPGE 

TAGVAGREPLVCSPPPPPPASPCAPP 

KVRSDMGPQCPCAS*WPSGLTKGP 

SCPFPVASHGGITPGQWPGEEETSR 

KERSSATK 

462 

5959 

A 

477 

2 

293 

PAAERSCLRVTFASACPASMEPKRI 
REGYLVKKGSVFNTWKPMWWLIE 
DGIELYKKNCDNSP*GMIPLRG1TLT 
RPWLDFGRRKCWFTKSSIYQYL 

463 

5960 

A 

478 

387 

511 

WDIPIFISDIYIILITGYLTTY*NVLH 
WKKIIYFYIALIVL 

464 

5961 

A 

479 

130 

240 

KNEQDPRDL*DNDKWPNIHVIGVPE 
EDKDNGTERVFD 

465 

5962 

A 

480 

116 

423 

GIRCPGPREASLLSQFILSMRQAGQ 

DWQPEAYTLRICQLEVFSTCVSSLL 

HPVCRSQ*LPMEPEVIPGWNGKPRG 

HWPVQIFKSFTHGTPNLAGPGCCCG 

VR 

466 

5963 

A 

481 

64 

343 

QLL**LSSTWEGLQAAKELDEQRGI 
GC 

467 

5964 

A 

482 

61 

342 

QPQTDTMGHLTPEEKSAVTDLWGK 
VNADEADGEALVTLLGVYPWTQR 
MFESFGDLDTPEADMGNPKVKAHG 
WKVL*AFIDGPAHPDQLKGNLCT 

46S 

5965 

A 

483 

557 

816 

SRHFERPWVDHLRLGV*DQPGQHG 
ETPSLQKIQKLARSGGTHL*SSYLG 
G*SGKNHLNPGSQGCSEP*SCHCTP 
GWVTEQNS VSKK 

469 

5966 

A 

485 

277 

322 

FFF*VYHVWFLFSFLICRFMPFAKFG 
NF*PLFLEIFFHPYSFSSL*YEW*SFC 
YCLRGLLCFHVYPLFLVYFSLFFILV 
NFC*LFFSSLILFFCHMQSTVELVQ 

470 

5967 

A 

486 

31 

309 

FLELGPGKPFGNMYDADDDMQYD 
EDDDEITPDLLQETCWIVIRSYFDKK 
G*VIQQLDSFD*SIHMTALRIGEYAA 
PIDLQADAHHASGEGEKP 

471 

5968 

A 

487 

130 

521 

KAKFRTFCFTSSFYN*DLDFKIYPSPI 

KVAEPS*LSGQCFSSLFFHQDLGFCF 

VLLFETESCSVTQVEHSGAISAHCN 

LRLPG * SNSP VSVSLAAGTTGTHH Y 

TQLIFVLVAEMGFCHVGQSGLELAS 

CR 

472 

5969 

A 

488 

32 

452 


473 

5970 

A 

489 

38 

525 

APSPDAMGHFTEEDKATITSLWGK 

VNVEDAGGETLGRLLVVYPWTQRF 

FDSFGNLSSASAIMGNPKVKAHGK 

KVLTSLGDAIKHLDDLKGTFAQUS 

ELH\CDKLHVDPENFKLLGVNVLVT 

VLAIHFGQRIHP*RCRASWAEDG*L 

GVASALVLQDTTELTCP 

474 

5971 

A 

490 

818 

947 

VCFLFLFF*DGVSLMLPRLECNGTIS 
AHRNLCFPGSSDSPVSA 

475 

5972 

A 

491 

17 

416 

PPSSNPMGHFT*EDTATITSLWGTV 
NAENAGGKTLLRLLGAYPWTQRLF 
DSFGNLSSASAIMGNPQGKAHGLK 
VLTLL*DAVKHLDDLMGTFSHPTEL 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO: of 
nuclco-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 







PCYKLHLDSENLKLLGYVLAIVMAI 
HFGKEVIPAV 

476 

5973 

A 

492 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

477 

5974 

A 

493 

34 

548 

APSPDA\MGHFTEEDKATITSLWGK\ 

VNVEXDAGGETLGRLLWYPWTQR 

FFDSFGNLSSASAI\MGNP\KVKAHG 

KKVLTSLGDAIKHLDDLKGTFAQIA 

SELH\CDK\LHVDPENFKLLG/NMLL 

VTRFGQSHFRAKNFTPEGCRASWQ 

KQKMAEDGDLQWPVPCSSR1PLKP 

LGP 

478 

5975 

A 

494 

527 

1022 

GWASAFLWLIKPGSPRGYRCNPHH 

VILPVSAGLELPLCSLLPSTDTCPAS 

QTGSGRANRATPGCGRPAGVRKGR 

PACKRSKNFRAACGSGARSRPGHR 

TPGSSRPPGRQKRAPWASQARRPPA 

♦SRPGGRGGAARPHPRRTGAPAGSA 

RGAQRSERARPQPRDPA 

479 

5976 

A 

495 

2 

379 


480 

5977 

A 

496 

3 

723 

VPRVCLLLQQCLDGTDPGTGLPASD 

RPPISSPLATSGTIFSAISCFWDLPAP 

FLWLAPSCQPTMSSQIRQNYSTDVE 

AAVNSLVNLYLQASYTYLSLGFYF 

DRDDVALEGVSHFFRELAEEK\REG 

YERLL\RMQNQ\RGGRALFQDIKKP 

AEDEWGKTPDAMKAAMALEKKLN 

QALLDLHALGSARTDPHLCDFLETH 

FLDEEVKL1KKMGDHLTNLHRLGG 

PEAGLGEYLFERLTLKHD 

481 

5978 

A 

497 

1 

196 

GTSVTKMEAFLGSRSGLWAGGPAP 

GQFYRITFTPDSFMDPASALYRGPIT 

RTQNPMVTGTSVLGV*IEGGWVIA 

GHMLGFYVCLDRLRDFYRFTRVNL 

STVLDASGDFAE*HYL*QFYRITFTP 

DSFMDPASALYRGPITRTQNPMVTG 

TSVLGV 

482 

5979 

A 

498 

1 

401 

GTRKWVTF1SLLFLFSSAYSRGVFR 

RDAHKSEVAHRFKDLG*ENFKALV 

VIAFAQYLQQCPFEDHVKLVNEVTE . 

FAKTCVADESPDN*D*SLHTLFGDK 

LCTVAILPETYGEMADCCVQLEPER 

NECFLQLKD 

483 

5980 

A 

499 

47 

411 


484 

5981 

A 

500 

316 

493 

LLVGRLALPEGDRHDQHQIQGLEQS 
ILKLEKEIQDLENAELQISTKEEAIL* 
KLKAIER 

485 

5982 

A 

501 

27 

526 

LSLTSRMEEAELVKGRLQA1TDKRK 

IQEEISQKRLKIEEDKLKHQHLKKK 

ALREKWLLDG1SSGKEQEEMKKQN 

Q\QDQHQ1QVLEQSILRLEKEIQDLE 

KAELQISTKEEAILKKLKSIERTTEDI 

IRSVKVEREERAEESIEDIYAN1PDLP 


WO 01/79449 AM PCT/US01/08656 


SEQ ID 
INOJ 01 

nucleotide 
sequence 

SEQ ID 
NO: of 
Deotide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location or 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid or 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /"possible nucleotide deletion; V-possible 

IIUUCUCIUC IIIACi It VIM f 







KSY1PSRLRKEIN 

486 

5983 

A 

502 

25 

208 

VSRIEAVSGSHGFS1HKLLTVNVITY 

DCVSSWCLYVSFQQKDPLVLGQRQ 

LKSKPAGDLNT*GKVIKCKAAIAW 

KAGKPLCIEEVEVALPKAHEARIQV 

SRWFRLELSLA 

487 

5984 

A 

503 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

488 

5985 

A 

504 

52 

562 

APSPDAMGHFT\* EDKAT1TSL WGK 

VNVEDAGGETLGRLLVVYPWTQRF 

FDSFGNLSSASA1MGNPKVKAHGK 

KVLTSLGDAI\KHLDDLKGTFAQLS 

ELHCDKLHVDPENFKLLGNVLVTV 

LAIHF\GKEFTPEVQASWQKMAED\ 

VTGVASALCFTKHLDFMCMMQSFQ 

R 

489 

5986 

A 

505 

801 

927 


490 

5987 

A 

506 

659 

837 

RKIKEAGHRGSQLYSQHFGRLRQE 
DCLSPGGQGCSEPRLHRCVPAWVT 
G*KKTLPKNKQ 

491 

5988 

A 

507 

3 

203 


492 

5989 

A 

508 

23 

678 

RPRVRMAEVQVLVLDGR\GHL\LGR 
LAA/LSVAKQVLLGRKVVVVRCEGI 
NISGNFYRNKLKYLAFLRKRMNTN 
PSRGPYNFRAPSRIFW\RTVRGMLP 
HKTKRGQAALDNRLKVFDGMPPPY 
D/KAPLFL*QKKRMVVPAALKVVR 
LKPTRKF\AYLGRLA\DEVGWKYQA 
" VTAT\LEEKRKEKAK\IHYRKKK*L\ 
MRLRKQ\AERNVRRIFANTPEVLKT 
HGLLV 

493 

5990 

C 

509 

275 

370 

MPQGGACSPVLPGSLVVSLLLTQSY 
LVWPQW* 

494 

5991 

B 

510 

1 

1122 

MVFLSGNASDSSNCTQPPAPVNISK 

AILLGVILGGLILFGVLGNILVILSVA 

CHRHLHSVTHYYIVNLAVADLLLTS 

TVLPFSAIFEVLGYWAFGRVFCNIW 

AAVDVLCCTASIMGLC11SIDRYIGV 

SYPLRYPTIVTQRRGLMALLCVWA 

LSLVISIGPLFGWRQPAPEDETICQIN 

EEPGYVLFSALGSFYLPLAIILVMYC 

RVYVVAKRESRGLKSGLKTDKSDS 

EQVTLRIHRKNAPAGGSGMASAKT 

KTHFSVRLLKFSREKKAAKTLGIVV 

GCFVLCWLPFFLVMPIGSFFPDFKPS 

ETVFKIVFWLGYLNSCINPIIYPCSSQ 

EFKKAFQNVLPJQCLRRKQSSKHAL 

GYTLHPPSQAVEGQHKDM* 

495 

5992 

A 

511 

928 

1311 

AMIVPTAVQPGRQSKDPVSKEKKE 

KARKERWLGTVAHSCNPRTLGGQG 

GWIMRSRDRDHPGQQGETPSLLKM 

QKLAGRGGGHQSRLLGRLRQENGV 

NPGGGACSEPRWHCCTPAWATE*D 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO; 
in USSN 
09/770 160 

Nucleotide 
location of 

firct rorinn 

lit 91 VwU|/|| 

Tor peptide 
sequence 

Nucleotide 
location of last 

enflnn fni* In^t 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno>vn; *=Stop 
codon; /=possible nucleotide deletion; V=possible 







SISNNNKK 

496 

5993 

A 

512 

23 

288 

APSPDAMGHFAEEDKAT1TSLWGK 
VN VEDAG GETLGRLL WNPG TLKL 
NSSLG*Q*FGGCILSPHHCLGKGRK 
CFFSIVEMLVILYFM 

497 

5994 

A 

513 

20 

207 

LDAGTACAETMACTSRLYGLPRST 
WPNHPDAILPEGYFSSEI*SRPDCGL 
RVIYRGLTISSA 

498 

5995 

A 

514 

228 

375 

CVALGAMRGMRRLPAGAPKMLMG 

V**ELDRLGYIAHPQLGKRARAGIV 

L 

499 

5996 

A 

515 

417 

573 

ETPTGLRGGTCL*S*LPRRLRWENC 
LNPGGRGCSEPRSHHCTPAWATEQ 
DS 

500 

5997 

A 

516 

173 

420 

LLLANQLMSLQIRQNYSTDLEAAV 
NRLGNLDLQAYYTYLYLGFYYDRD 
DEGLEGVSHFFRELAEDKRDRY*RL 
LTMQNQRGG 

501 

5998 

A 

517 

3 

415 

HEGHQYAPNPDAMGHFTEEDKATI 

TSLWIKVNEENAG*ETLARLLAGYP 

WTQRJFDRFGNLFFASDIMGNSPVQ 

AHGKNVLTSLLDATKHLDDLKGTF 

AQLSELHCYKLHVDPENFHALANE 

LATALAMHFR**FTP 

502 

5999 

A 

518 

3 

232 


503 

6000 

A 

519 

1 

2361 


504 

6001 

A 

520 

4806 

5788 

HTLFGDKLCTVATLRETYGEMADC 

CAKQEPERNECFLQHKDDNPNLPR 

LVRPEVDVMCTAFHDNGETFLKK* 

VIRCL*FKIKKHGVTP*ANTL*KLP* 

QKYFQH*DLEVLL**FFKEVVFDTT 

KFYTAKNMIKD1LKFIETGYNLSQK 

FKIDKFFNVFRRYVYMWIIDFVLV 

SNHLPKFNHLCTHTHTHTHLTLFST 

YLKNDRDKTIMCKLSLIG*L\ESLEF 

GGSGENVDYNYFCNIVCYRK/ADCF 

SFLKFRYLYEIARRHPYFYAPELLFF 

AKRYKAAFIECCQAADKAACLLPK 

VLCTRIEKKSLLSNLILSILWLDLGT 

LSV 

505 

6002 

A 

521 

151 

364 

VTHDCICYLQQTHF*PKDKNRLKLK 

RCKKQFHENSNQKRVEVALLISAQ 

RDLRSKIDTEGKSIQQRKKSSC 

506 

6003 

A 

522 

925 

1168 

SQHFGRPRWVDHLRSGIGDQPGQH 
GETPALLKIQKLARCGYMRL*SLRR 
LRRENHLNPGGGGCSETRLHHCIPA 
WATEQDS 

507 

6004 

A 

523 

142 

329 

THSLFLLWSLSHHSPTVNTTLRNLG 
ALHRRHGKL*AAETLDVFNLTSSCS 
LLFNPFYRNFVR 

508 

6005 

A 

524 

108 

283 

KQNLILSPRLKCNGPISVN*NFNLPG 
LTRSQA*ASREAGTTGTCYHA**IG* 
IFIIDG 


6006 

A 

525 

1 

345 

GTRAAPLRIQSDWAQALRKDEGEA 
WLSCHPPGKPSLYGSLTCHGIVLYG 
IP*ATSSHRFIANDPNIITSHSSRPTVF 
VPSSFSSLILFFLAHPLSISLPFFSLPA 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
fttir Ipn-tirfp 

sequence 

SEQ ID 
NO: of 

sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 

09/770 160 

Nucleotide 
location of 

Rfct f Annn 

for peptide 
sequence 

Nucleotide 
location of last 
luuuii ior msi 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; ^possible 
nucleotide insertion) 







FPLNFLPLRS 

510 

6007 

A 

526 

3 

276 

HEPRRPQYSSGRRAAWLSYSLFSAG 
CGASAPRPLVMSDSGSYGKSDVEH 
LYYRNY*STRI*GYIQTSHI*SG*GM 
TTDSYYGINIFYKLQ 

511 

600S 

A 

527 

2297 

2435 

LKLVSKKRVYNF1LILLML*TYFLK 
DGLFECLWHLTCKKKKLQKNP 

512 

6009 

A 

528 

123 

317 

QETKKEQNKENKQIK*RSTRKKHR 
QGTNKTKERGERQTPPVGNRQTPT 
LGIHARPRRRATTSPRA 

513 

6010 

A 

529 

787 

1069 

FASHFGRLRQADPLRSGVQDQPGQ 
QGETPSLLKIQKFPRRDGGRL*SQLP 
RKLRQENCFNRGGGDCSEPRLCPFL 
PAWATERNSVKGKERKEKK 

514 

6011 

A 

530 

110 

369 

CWLSCCLEVRSCLYTFLSAYNFKCV 
LTI*HTFFVFFWSLCVYYFFIVLCCL 
VLVWCLSSLYYGIIVYYLYFCYSLFI 
VLGYGILAV 

515 

6012 

A 

531 

268 

331 

QM*TAKCARCEGLGLITLCLDCIVA 
NTLLLVPNGETSWTNTNHLTLQVW 
LKDGYIGWGLMALCTGIAPVLAGG 
KDCCGARRCGNR*QMLRYDFS*AL 
VVLGA1YWLS 

516 

6013 

A 

532 

807 

1060 

S WHFGRLR WADYLRPGAGDQLSQ 
HGEISSLLKTQKLPGCGDTHL* SQLL 
GRLRQENHLNLGGGGCSEPRSHHC 
TSAWVTERDSV 

517 

6014 

A 

533 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

518 

6015 

A 

534 

38 

550 

APSPDAMGHFTEEDKATITSLWGK 
VNVEDAGGETLGRLLVVYPWTQRF 
\FDSFGNLSSA\SAIMGNPKVKAHGK 
KVLTSLG\DAIK\HLE* SQGAPFAQA 
* SELH\CDKPALLDPGGTFKLPGEN V 
AGLTVFGQSHFRAKEFHP*RLQAS 
WHKQKMAEDGDLELASALVPSRY 
H 

519 

6016 

A 

535 

2 

348 

ARAGAGRLRRAASALRLLSPRLPVR 

ELSSLARLYPHRVDDHYENPTNAGS 

LD*TSKNVGTGLQLAPA*GDWKL 

QTLVDEKVKNVDARFKTLGCGSAI 

AYSSLATEWVTGKTADE 

520 

6017 

A 

536 

385 

536 

RMSAGALFIGYCIYFDHKRRSDPNF 
KNRL*DGRKKQKLAKERAGLSKLP 
D 

521 

6018 

A 

537 

123 

705 

AAPTALRVRGPPLLRGPCRHRPRSA 

FVEKMVGRNSAIAAGVCGALFIGY\ 

CIYFDPQKTK*TPTFKNRLRERRK\K 

QNLCQRRELGL\SKLPD\LKDAESCC 

RKFFL*RNTSLGEELLSFDG*/YEY*E 

RAVDHLDKLPIAV\CGQ\PQQ\LLQV 

LQQTL\PPPVF\QMLLTKLPTISQRIV 

SAQSLAENDDVGMRNKCLH 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 

11 U CI CU*II UC 

sequence 

SEQ ID 
NO: of 
pcpuuc 
sequence 

Me 
tho 
u 

SEQ ID NO: 
inUSSN 

fl0/77fl 1£fl 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=OJnknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 

522 

6019 

A 

538 

1 

430 


523 

6020 

A 

539 

42 

373 


524 

6021 

A 

540 

1 

430 

QQLQRLVHPDFFSQRSQTEKDFSEK 

HSTLVNDAYKTLLAPLSRGLYLVS* 

SS/Y GIEIPERTDYEMDRQFL1EIMEI 

NEKLAEAESEAAMKEIESIVKAKQK 

EFTDNVSSAFEQDDFEEAKEILTKM 

RYFSNIEEKIKLKKIPL 

525 

6022 

A 

541 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

526 

6023 

A 

542 

38 

547 

APSPDAMGHFTEEDKATITSLWGK 

VNVEDAGGETLGRLLVVYPWTQ\R 

FFDSFGNLSSASAIMGNPKVKAHGK 

KVLTSLGD\AIKHL\DDLKG\TFAQA 

*SELH\CDKAALLDPENF\KLPGGNV 

AG*PVFGQSHFRAKEFHPWRLQGFP 

GISRRWQKMVTWSWPVPCSSRYH 

527 

6024 

A 

543 

328 

495 

NLGANNCSLLGIGLLKGSMSGRLW 

PKAFSAG*KQGLQNQRKHTALVKIE 

DVDA*GE 

528 

6025 

A 

544 

154 

340 

PGLLKAAIWGIAYLRATYWTYVLA 
DLHPFADMLHAGYSITSEVEQPVLA 
VQLTYNPDES*WP 

529 

6026 

A 

545 

124 

323 

EVKSVYLVYILSNRFF*CTYMHILV 
YYVYFIGLTI*LEEHSMLVYQNLVH 
YFLVFVNVGIYLLYLV 

530 

6027 

A 

546 

314 

445 

SPILLQFTWLTRYLFTKIQFI1YFFET 
ESCSIAQARV*WCDLG 

531 

6028 

B 

547 

1 

1011 

MDLKFNNSRKYISITVPSKTQTMSP 

HIKSVDDWVLGMNLSKFNKLTQF 

FICVAGVFVFYLIYGYLQELIFSVEG 

FKSCGWYLTLVQFAFYSIFGLIELQL 

IQDKRRRIPGKTYMIIAFLTVGTMG 

LSNTSLGYLNYPTQVIFKCCKLIPV 

MLGGVFIQGKRYNVADVSAA1CMS 

LGLIWFTLADSTTAPNFNLRVLYSY 

SIGFVYILLGLTCTSGLGPAVTFCAK 

NPVRTYGYAFLFSLTGYFGISFVLA 

LIKIFGALIAVTVTTGRKAMTIVLSFI 

FFAKPFTFQYVWSGLLWLGIFLNV 

YSKNMDKIRLPSLYDLINKSVEARK 

SRTLAQTV* 

532 

6029 

A 

548 

244 

1408 

SRHNGMDLTQQAKDIQN1TVQETN 

KNNSESIECSKITMDLKFNNSRKYIS 

ITVPSKTQTMSPHIKSV*RWVLGM 

NLSKFNKLTQFFICVAGVFVFYLIY 

GYLQELIFSVEGFKSCG\WYLTLVQ 

FAFYSIFGLIELQLIQDKRRRIPGKTY 

MIIAFLTVG\TMGLSNTSLGYLNYPT 

Q VIFKCCKLIP VMLGG VFIQGKRYN 

VADVSAAICMSLGLIWFTLADSTTA 

PNFNLTGVVLISLALCADAVIGNVQ 

EKAMKLHNASNSEMVLYSYSIGFV 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 







YILLGLTCTSGLGPAVTFCAKNPVR 

TYGYAFLFSLTGYFGISFVLALIKIF 

GAL1AVTVTTGRKAMTIVLSFIFFAK 

PFTFQYVWSGLLVVLGIFLMFTAKI 

WDKIRLPSLV 

533 

6030 

A 

549 

66 

346 

IQQLPTFFHIFS1FFLIR*FFYMKGFR* 
LVLFIYCPHVYA* SYFSLLFFCSLTI* 
FISFSLYFTLFLLFFTFLLFICVLAMFI 
FFELHLSYIP 

534 

6031 

A 

550 

21 

337 

GPEAQCPDQPPPWLSFQGLPQGTT 

WATHSAPCSPNLTSRSWCPDSEPGR 

AGGRGRPPTLDHDAPPTTPL*PSKP 

HPCIPQALPSSRTLRPLYATPRQHAA 

TQCTP 

535 

6032 

A 

551 

526 

771 

PPPLGVPGTLQFLRPRAAVLIGSKLL 
RPGRFCRWIFSPLLLVNISWLGTVV 
HACNPSTLGDQGGRIT*G*EFETSLP 
TWRNS 

536 

6033 

A 

552 

305 

569 

KKPLKGEKGGSLKTRPSFKKPDAKI 
YLKKSVGFL*TNPEQFKKEIRNTIPLI 
KGASSSSSSKTNLGINLTKVVKDLN 
NENSRTLLRQS 

537 

6034 

A 

553 

90 

339 

EVSALPDLPAVMLAGPTP*PSFPRTP 
SYFSAPPLLLPLSCSFPLLPLPMPHSC 
PPSSSPSPPSLLLLSITPSPAPSPFPLLF 
P 

538 

6035 

A 

554 

1179 

1408 

GYPVGKRRLGERQGPRQPPTLLPCD 

KEAERGEHIYIYFIYILYI*YIYNIYII 

YYIYNIYIHIYYIIYIHTYIIYI 

539 

6036 

A 

555 

722 

991 

SQHFWRPRQVNHVSLGVQDQHGQ 
HSENPVSTKIYIYIQKLARCSDRCL* 
S*LLRRLRHENHLNLGGGGCSELKS 
CHCTPAWATE*DPVSK 

540 

6037 

A 

556 

1 

362 

GTSRQVCREHSFQSVKLSAGARSW 

CFLSHWDPAGEVSLTDCSEIFLPFLG 

MAAVYHYFSINIFFKTSFFRLILIY** 

SYFHLYFLYYS1LCLFILLLFIIFYYC 

YILFISNLFTIIFLFL 

541 

6038 

A 

557 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

542 

6039 

A 

558 

38 

497 

APSPDAMGHFTEEDKATIT\SLWGK\ 

VNVEDAGGETLGRLLVVYPWTQRF 

FDSFGNLSSASAIMGNPKVKAHGK 

KVLT\SLGDAIK\HL\DDLKG\TFAQA 

♦SELALVDKLACGILENFKAPGEML 

LVTRFWQSHFRQKNFTPEGCKASW 

AERWVTW 

543 

6040 

A 

559 

1 

414 

FETVSLLLLRLEHTGTISTHCNLRLP 

GSNDSAASAS*VAGTTSVCHHTGLI 

SVFSIETEFHHVGQTGLELLTSSDPL 

TSASPGAGIKGGSHCAQSPICFRGN 

NEMNYQATGIYSKSEIFFCLGYVTM 

SRCLTSQGSGS 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 

thn 

■ 111! 

d 

SEQ ID NO: 

in HORN 

III Uwl* 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; Impossible 
nucleotide insertion) 

^AA 


A 

A 


1 "70 

178 

i *y a 

334 

NVCRLPVTNAESDAMINDA1RPINF 
TGFLTMFA*NLTGADPADVIIAAFD 
VL 

545 

6042 

A 

561 

322 

649 


546 

6043 

A 

562 

3 

452 


547 

6044 

A 

563 

24 

587 

GIPQTQREPTMVLSPADKTNV\KAA 

WGKVGAHAGEYGAEALERMVIFLSF 

PTTKTYFPHFDLSHGSAQVKGHGK 

KVADALTNAVAHVDDMPNALSAL 

SDLHAHKLRVDPVNFKLLSHCLLV 

TLGA\HLPAEFTPAVHA\SL\DKFLAS 

VSTGL\TSKYPLSWSPRWPCFLAPW 

ASPQPLLPFPAPVPPWSLK 

548 

6045 

A 

564 

3 

474 


549 

6046 

A 

565 

1099 

1243 


550 

6047 

A 

566 

425 

943 

MGRSAPVEISYETMRFMMTRNPTN 

ATLNKFTEELKKYGVTTLVRVCDA 

TYDKAPVEKEGIHVLDWPFDDGAP 

PPNQIVDDWLNLLKTK\FREGARVC 

CVAWHCVGRVGEGAPVL/VLALAL 

DWNVGMK\YEDAV\QFIRQKRRGA 

FNSKQL\LYLEEYRPKMRLRFRDTN 

GHC\CVQ 

551 

6048 

A 

567 

1 

441 


552 

6049 

A 

568 

1 

890 

MSKSESPKEPEQLRKLFIGGLSFETT 

DESLRSHFEQWGTLTDCVVMRDPN 

TKRSRGFGFVTYATVEEVDAAMNA 

RPHKVDGRVVEPKRAVSREDSQRP 

DYFEQYGKIEVIEIMTDRGSGKKRG 

FAFVTFDDHDSVDKTV1QKYHTVN 

GHNCEVRKALSKQEMASASSSQRG 

RSGSGNFGGGRGGGFGGNDNFGRG 

GNFSGRGGFGGSHGGGGYGGSGDG 

YNGFGNDGSNFGGGGSYNDFGNY 

NNQSSNFGPMKGGNFGGRSSGPYG 

GGGQYFAKPRNQ/GGYGGSSSSSSY 

GSGRRF 

553 

6050 

f 

A 

569 

579 

2102 

SPKEPEQLRKLFIGGLSFETTDESLR 

SHFEQWGTLTDCVVRFGRDKAVKQ 

PISLAYLGAVFSECL*K*LIAL*LELC 

WQRNVLL*F*KLTS*I*G*WETGRTF 

YKRLV*SFLLPYSKLK*QKLLRSDF 

VLHKLTLFSG\MRDPNTKRSRGFGF 

VTYATVEEVDAAMNARPHKVDGR 

WEPKRAVSREVSGFFFFFFLNLLG 

YVLL*T*DSGVF*TYQNFLFEYRLC* 

SKPMVFLLL\DSQRPGAHLT/V*KKI 

FVGGIKRRHLKEHHLRDYF\EQYGK 

IEVIEIHDLTRGSGKK\RGFAFVT\FD 

DHDSVDKIVSKYQIVAFSKGSTICM 

AF*TLIPCCIYVFFLVQKYHTVNGH 

NCEVRKALSKQEMASASSSQRGML 

VA*LNLKGNFELLQYE*FNA*TSCL 

iW/bbUoUNruGGRGGGFGGNDNF 

GRGGNFSGR/GYVWFIYM*F*LLTIF 

AMKILQYGNCIQNVTLSPSHT*NLK 

LFLTGGFGGSRGGGGYGGSGDGYM 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *~Stop 
codon; /^possible nucleotide deletion; ^possible 







GFGNDGKFFRNK 

554 

6051 

A 

570 

250 

381 


555 

6052 

A 

571 

249 

468 

PNQRLKWKS*LMGQGRG*KWKLL 

VLFYHKA*RMWPA\C\CLDLGLGTG 

\CTC\CLLVYANWLHLLFLCLCPYP 

556 

6053 

A 

572 

2 

488 

QEPAHDLRMYGKI1\FVLLLSEIVSIS 
ALSTTEVAMHTSTLLPSSHKRVTSS 
S\QTNGETGTTCPIVSLYPAPCSDNT 
HYFVCDGWYYWNDPLNFLLYSMT 
DKGMRMWPACCLILPRYTSCTCCSL 
AYANW\LHL\LFL\CLCPYPWAILNS 
LFSWPSLITGILYF 

557 

6054 

A 

573 

7 

412 


558 

6055 

A 

574 

3 

479 

NWELLLWLLVLCALLLLLVQLLRF 

LRADGDLTLLWAEWQGRRPE/WEL 

TDMVVWVTGASSGIGEELAYQLSK 

LGVSLVLSARRVHELERVKRRCLE 

NGNLKEKDILVLPLDLTDTGSHEAA 

TKAVLQEFGRGFFNGLRTELATYPG 

IIVSNICPGPVQSN 

559 

6056 

A 

575 

1 

321 


560 

6057 

A 

576 

2 

1243 

GAASAEPGAPEPLLLPACSLGGAGA 

VRLWAGRRGGAAIPQGSDATLVRA 

VFFPPSWACAAAMNWELLLWLLW 

LCDV\LLLLVQLL\RFLRADG\DLTL 

LWAEWQG/RDRPEWE\LTDMVV\W 

VNTGASSG/ILGEELAYQLSKLGWSL 

VLSARNRVHELEKGEKERCIAENGQF 

LKEKDITLFLPL\DLDPTLGSH*SRLT 

KAVLQEVLVRIDILGSTMVGM\SQR 

SL\CMDTSLDVYRKLI\ELNYLGTVS 

LTKCWLPHMIERKQGKIVTVNSILG 

IISVPLSIGYCASKHALRGFFNGLRT 

ELATYPGIIVSNICPGPVQSNIVENSL 

AGEVTKTIGNNGDQSHKMTTSRCV 

RLMLISMANDLKEVWISEQPFLLVT 

YLWQYMPTWAWWITNKMGKKRIE 

NFKSGVDADSSYFKIFKTKHD 

561 

6058 

A 

577 

175 

354 


562 

6059 

A 

578 

2018 

2182 


563 

6060 

A 

579 

140 

287 

MVKRNQCPSLPPN*KMRSQGSTCQ 
PHCQRWLPSTRSYTHPLKARPWSA 
S 


oUoJ 

A 

C OA 

580 

357 

760 


OOJ 

OUoz 

A 

CO 1 

581 

182 

459 


300 

6063 

A 

582 

1 

382 


567 

6064 

A 

583 

3 

406 


568 

6065 

A 

584 

173 

415 


569 

6066 

A 

585 

2 

424 


570 

6067 

B 

586 

108 

395 

VGAHAGEYGAEALERMFLSFPTTR 
TYFPHFDLSHGFCPGLRGHGQEGGR 
RADQRRGARGTTCPTSLSALSDLHA 
HKLSGGTRFNFQAPKATGLLG* 

571 

6068 

A 

587 

379 

579 


572 

6069 

A 

588 

2 

366 

SLPASDRPPISSPLATSGTIFSAISCF 
WDLPAPFLWLAPSCQPTMSSQIRQN 


WO 01/79449 


PCTAJS01/08656 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /=possible nucleotide deletion; ^possible 
nucleotide insertion) 







YSTDVEAAVNSLVNLYLQASYTYL 
SNLQDIKKPAEDEWGKTPDAMKAA 
MALEKKLNQALLDLHALGSART 

573 

6070 

B 

589 

220 

480 

MSSQIRQNYSTDVEAAVNSLVNLY 
LQASYTYLSLGFYFDRDDVALEGV 
SHFFRELAEEKREGYERLLKMQNQ 
AWRPRSLPGHQEAS* 

574 

6071 

A 

590 

142 

383 


575 

6072 

A 

591 

1 

308 


576 

6073 

B 

592 

195 

326 

MMGVLDGVLMELQDCALXLLKDV 
IATDKEDVAFKDLDVA1LVV* 

577 

6074 

A 

593 

5 

1199 

PDSLRLILHLFKLSPQFSIMSEPIRVL 

VTGAAGQIAYSLLYSIGNGSVFGKD 

QPHLVLLDITPMMGVLDGVLMELV 

RLCPSPPERCGNGSVFGKDQPIILVL 

LDITPMMGVLDGVLMELQDCALPL 

LKDVIATDKEDVAFKDLDVAILVGS 

MPRREGMERKDLLEY/ADVKIFKSQ 

GAALDKYA\QKSGKVIVGGNPANT 

DCLTASKPAPC1PKENFSCLTRLDH 

NRAKAESGLRLVVTAHDGQNGIIW 

GNHSSTQYPDVNHAKVKLQGKEV 

GVYEALKDDSWLKGEFVTTVQQR 

GAAVIKARKLSSAMSAAKAICDHV 

RDIWFGTPEGEFVSMGVISDGNSYG 

VPDDLLYSFPWIKNKTWKFVEGLP 

INDFSREKMDLTAKELTEEKESAFE 

FLSSA 

578 

6075 

A 

594 

46 

298 


579 

6076 

A 

595 

982 

1193 


580 

6077 

A 

596 

69 

399 

VSNYPTVGCCIFLQIRARNPAFQPQT 

LMDFGSGTGSVTW*VTFFSPILVNF 

SSRKPYLHHSKINRLENQRENRQVG 

NL*CFFHQIRQGRRRYMDWGQNLK 

EMSSKKRRMY 

581 

6078 

A 

597 

600 

887 


582 

6079 

A 

598 

813 

973 


583 

608O 

A 

599 

166 

437 

ADHLKSGV*DQPGQHGEILSLLKLQ 
*FPGRGGAHL*SQLLGRLKQENHLN 
PGGGGCSEPRLCHWTPVRATVGDS 
VQKK* KSQDGPRAKLG 

584 

6081 

A 

600 

3 

238 

SGDRDHPG*HSETLSLLKIQQ\IAGR 
GGGRL*SRLLRRLRQENGVSPGGG 
ACSEPRSHHCTPAWETERDSVSKK 
KKKKL 

585 

6082 

A 

601 

4005 

4345 

SQHFGRPRRADHLRSGVQDQPDQH 
GETPSLLGGRGGRITKSGDRDHPG* 
HGETPSLLKMQ/EKLAGRGGGRLW 
SQLLGRLRQENGVSPGGRACSEPRS 
CHCTPAWLTEQDSVSKK 

586 

6083 

B 

602 

1 

9234 

MGAPTLPPAWQPFLKDHRISTFKN 

WPFLEGCACTPERMAEAGFIHCPTE 

NEPDLAQCFFCFKELEGWEPDDDPI 

EEHKKHSSGCAFLSVKKQFEELTLG 

EFLKLDRERAKNKIAKETNNKKKEF 

EETAKKVRRAIEQLAAMD* 

587 

6084 

A 

603 

1577 

2233 

SGCLLSPPSVGRQNSPVELGGAGLS 
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SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
INU: oi 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; ^possible 
nucleotide insertion) 







RAGWAPQERGRAALLLISPGPNVR 

GGPDWLPSVLQMRGLPLWDLGGRP 

DVGRMSPGGRPGSCWATQLRFH1SS 

LAPLFSWAGRSGSRLNPSTLGGRGG 

P1TRSGDRDHPG*HGETLSLLKIQKI 

SQACWR/CACSPSYGRLRQENGVNP 

GGGACREQRSGHCTPAWATEQDSV 

SKKKKKKSGSTIRLKHILHKII 

588 

6085 

A 

604 

151 

454 

FQKIGPGAVAHACNPSTLGGRSRRI 

TRSGGRDHPG*HSETPSLLK1Q\KLA 

GRGGGCL*SQLLWRLRQENGVNPG 

GGACSEPRSRHCTPAWVTERDSVS 

KKK 

589 

6086 

A 

605 

1362 

1647 


590 

6087 

A 

606 

10289 

10708 

SQHFGKLRQEDHLRSGVREQPGQH 
GKTPYLLKIQKLARRSGACL*SQLL 
RRLRQENRLNPGGVGCSEPRLHHC 
TTAWTLQ*DPVSKKLKKKYIERQR 
YHQHMKHP WSTKIQYVCMDG *HR 
SVEKQIIQTLCMFVFTHTY 

591 

6088 

A 

607 

709 

980 


592 

6089 

A 

609 

234 

381 

PPWTQFSLSCVCLL/CSRPA/VSAWR 
QARENESQAKGETAYETITSCENRS 
H 


6090 

A 

A 

/! A 

610 

1 

1755 


oy4 

£. f\C\ 1 

6091 

A 

/i i 
611 

1128 

1321 



ooyz 

A 

612 

650 

OA A 

800 



6093 

A 

613 

149 

475 


/ 


A 

614 

1 

801 



6095 

A 

615 

1284 

1386 


599 

6096 

A 

616 

20 

3888 


600 

6097 

A 

617 

204 

411 


601 

6098 

A 

618 

1 

1468 


602 

6099 

A 

619 

48 

178 


603 

6100 

A 

620 

79 

1953 

LQVGTASSLLLDSRVFGDRGYSPET 

RKCPKPINVRVTTMDAELEFAIQPN 

TTGKQLFDQVVKTIRPSRQVWYFXG 

LHYVD\NKGFPTW\LKL\DKKVSAQ 

EVRKKNPLQFKFR/APKFYP\EDVA\ 

EELIPGTFTQKLFF\LQVEGRESLSDE 

DLLAPLETGRALWGSYACASPRLG 

DYNK/EKLHKSGVPSASERLIPQRV 

MDQHKLTRDQWEDRIQVWHAEHR 

GMLKDNAMLEYLKIAQDLEMYGIN 

YFEIKNKKGTDLWLGVDALGLNIY 

EKDDKLTPKIGFPWSEIRNISFNDKK 

FVIKPIDKKAPDFVFYAPRLRINKRI 

LQLCMGNHELYMRRRKPDTIEVQQ 

MKAQAREEKHQKQLERQQLETEK 

KRRETVEREKEQMMREKEELMLRL 

QDYEEKTKKAERELSEQIQRALQLE 

EERKRAQEEAERLEADRMAALRAK 

EELERQAVDQIKSQEQLAAELAEYT 

AKIALLEEARRRKEDEVEEWQHRA 

KEAQDDLVKTKEELHLVMTAPPPP 

PPPVYEPVSYHVQESLQDEGAEPTG 

YSAELSSEGIRDDRNEEKRITEAEKN 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

in IKSN 

III \J&>Jl^ 

09/770,160 

Nucleotide 

lULtlllUll Ul 

first codon 
for peptide 
sequence 

Nucleotide 
lucdiiufi oi lasi 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=S(op 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 







bRVQRQLVTLSSELSQARDENKRTH 

NDIIHNENMRQGRDKYKTLRQIRQ 

GNTKQRIDEFEAL 

604 

6101 

V 1 V 1 

A 
f\ 

%J£t 1 


jOi 


605 

610? 

A 

fill 


JO/ 

iDybuUCCSVWLSLQGPPKGCPKP/I 
rbrOL^FKA 1 PPA* VQQRTSHPMSC 

606 

6103 

A 

623 

1792 

1935 


607 

6104 . 

A 

624 

9 

326 


608 

6105 

A 

625 

250 

381 


609 

6106 

A 

626 

155 

457 

NQKELGNTPRYPLEASNWLQPVKD 

WPVTNQRLKWKS*LMGQGRG*KW 

KLLVLFYHKA*RMWPA\C\CLDLGL 

GTG\CTC\CLLVYANWLHLLFLCLC 

PYPWLS 

610 

6107 

A 

627 

2 

488 

QEPAHDLRMYGKIIXFVLLLSEIVSIS 
ALSTTEVAMHTSTLLPSSHKRVTSS 
SXQTNGETGTTCPIVSLYPAPCSDNT 
HYFVCDGWYYWNDPLNFLLYSMT 
■ DKGMRMWPACCLILPRYTSCTCCSL 
AYANW\LHL\LFL\CLCPYPWAILNS 
LFSWPSL1TGILYF 

fit 1 . 

(\ XML 

A 



1 C A 

364 


612 

6109 

A 

629 

946 

1142 

LSGIIHYSFFTIRNIKALFSLC*VFQF 
GFLRDFPFIFPFIFRKPILTKGPTSVA 
M*WKGGIHFIA 


Ai in 

A 


946 

1 193 

LSGIIHYSFFTIRNIKALFSLC*VFQF 
GFLRDFPFIFPFiFRKPILTKGPTSVA 
M*WKGGIHFIA*SAFPIVQGLLFRS 
WNL 

614 

6111 

A 

631 

946 

1142 

LSGIIHYSFFTIRNIKALFSLC*VFQF 
GFLRDFPFIFPFIFRKPILTKGPTSVA 
M*WKGGIHFIA 

615 

6112 

C 

632 

294 

710 

MVRSRQMCNTNMSVPTDGAVTTS 

QIPASEQETLVRQESEDYSQPSTSSSI 

IYSSQEDVKEFEREETQDKEESVESS 

LPLNAIEPCVICQGRPKNGCIVHGKT 

GHLMACFTCAKKLKKRNKPCPVCR 

QPIQMIVLTYFP* 

616 

6113 

c 

633 

822 

1149 

MLVLHICLLLTIRGFRAWSRGSLKT 

PQFPSRGLTTAEARRPGPRGSFHSPG 

QGTGRSYALIRGGTVLLAAKAAGS 

KbbOSRrPLGLGFLLHLSDTQGHTG 

PRSSQARAV* 

617 

6114 

A 
/\ 


D 

lO 


61S 

6115 

A 

635 

269 

354 


619 

6116 

A 

636 

184 

299 

FFCTFSTDGVSPC*PGWSRSPDLVIH 
SPRPPKVLGLQA 

620 

6117 

A 

637 

3 

307 

ESCSEAQAGVQGAQSWLTATSSFQ 
VHAILLPQPPK*LGLQVPATTPG*FF 
VFLVETGFHCVSQDGLKLQTS*SAH 
LGLPKCWDYRHEPLRPAKKQLFKN 
VP 

621 

6118 

A 

638 

2 

131 

SKAALTGSGPGP/TPI PFV^AVI APF1 ~ 
RPS*SLLAGRGLDGGQD 

622 

6119 

A 

639 

1 

822 


623 

6120 

A 

640 

1258 

1454 

LSGIIHYSFFTIRNIKALFSLC*VFQF 


SEQIO 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SE<7TONO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknoivn; *=Stop 
codon; /-possible nucleotide deletion; \=possible 
nucleotide insertion) 







GFLRDFPFIFPFIFRKPILTKGPTSVA 
M*WKGGIHFIA 

624 

6121 

A 

641 

248 

386 

SARLSLPKIWDYRREPLHPARSFFIY 
SSSSILY*S*LVSIETALLF 

625 

6122 

A 

642 

132 

243 

LGLQVPATAPG*IFFVFLVETGFHH 
VSQDGLDLLTS 

626 

6123 

A 

643 

397 

954 


627 

6124 

A 

644 

1 

1388 


628 

6125 

A 

645 

2285 

2409 


629 

6126 

A 

646 

36 

224 


630 

6127 

A 

647 

242 

933 

YGESKDWNQKDLLSALVLTTVNCL 

PTPIMAKSAEVKLAIFGRAGVGKSA 

LVVRFLTKRFIWEYDPTLESTYRHQ 

GNHSMMEVVSMGGY*DTAGQEDH 

QREGHMRWGEGFVL\VYDIT*PRKF 

LKEVLALKEH\LDEIKKPKNVTLILV 

ACAFYECSACTGEGNTTETFYFT PRF 

VRRRRMVOGKTRRRSSTTrr\^ 

NEMLTKISS 

631 

6128 

A 

648 

596 

709 


632 

6129 

A 

650 

1 

367 


633 

6130 

A 

651 

135 

307 


634 

6131 

A 

652 

170 

372 


635 

6132 

A 

653 

3 

320 


636 

6133 

A 

654 

24 

r 

452 

APSPDAMG/HSLWGKVNVEDAGGE 
TLGRLLVVYPWTQRFFDSFGNLSSA 
^ATMONPKVK AHGKKVT T<sT fin AT 

KHLDDLKGTFAQLSELHCDKLHVD 
PENFKLLGNVLVTVLAIHFGKEFTP 
EVQASWQKMVTGVASALSSRYH 

637 

6134 

A 

655 

52 

518 

APSPDAMGVHFTEEDKATITSLWGK 
VNVEDAGGETLGRLLWYPWTQRF 
FDSFGNLSSASAIMGNPKVKAHGK 
KVI T*?! GDAIYFHT DDT KGTFAOI "3F 

LHCDKLHVDPENLKLLGNVLFTAT 

AIOFRl^NSPL*GOASWOKMVTGV 

ASALSSRYH 

638 

6135 

A 

656 

123 

219 


639 

6136 

A 

661 

413 

545 


640 

6137 

A 

662 

4 

350 


641 

6138 

A 

663 

1034 

1091 


642 

6139 

A 

664 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAT 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

643 

6140 

A 

665 

38 

602 

APSPDAVMGHFTEEDKATITVSLWGK 
VNVE\DAGGETLGRLLWYPWTQR 
FFDSFGNLSSASAI\MGNP\KVKAHG 
KKVLT\SLGDAIK\HLDDLKG\TFAQ 

LLVTRA^LAIPFSAKEFTPEGCRASW 
AERWVTCSWPVALFLQDTTEAQLP 
MNAELFKDKAFILASNYK 

644 

6141 

A 

666 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 


WO 01/79449 


ft 
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SEQ ID 
rifjl 01 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Un known; *=Stop 
codon; /^possible nucleotide deletion; \=pos$ibfe 
nucleotide insertion) 







TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

645 

6142 

A 

667 

38 

536 

APSPDA\MGHFTEEDKAT1\TSLWCK 

VNVEVDAGGETLGRLLVVYPWTQR 

FFDSFGNLSSASAI\MGNP\KVKAHG 

KKVLT\SLGDAIKHL\DDLKG\TFAQ 

A* SEL\HC* QAGM WDP\ENFK\LLGE 

MLLVTRFGQSHFRQKNFTP\EVARL 

SWAERWVTWSWPSALVPSRYH 

646 

6143 

A 

668 

132 

357 


647 

6144 

A 

669 

I 

89 


648 

6145 

A 

670 

136 

594 

LNRVAFLPGAAVILIGHLHTHTGPS 

GVCNVSMRGFSSPAGWTPTGSHRG 

KERPAGRLMHRRMGWSAVEWTGX 

AQGIPC1STCPERTGGDAATRSPRPP 

VLPPPPRPPQRRCRHLVSRAGTPRC 

ACAGTLTSKRGTHWRSTELLLRRSP 

LRSSQ 

OH 7 

OJ40 

A 

fin 
oil 


696 


650 

6147 

A 

672 

120 

352 


651 

6148 

A 

673 

276 

401 


652 

6149 

A 

674 

139 

470 


653 

6150 

A 

675 

136 

1058 

gvvgaaasgagsrkaglagvpgpp 

granresppgpvamgrvirgqrkg 

ag\svfrahvkhrkgaarlravdf 

aerhgyikguvkdiihdpgrgapla 

kv\vfrdsyrfkkrtel\fiaaeg\ih 

tgqf\vycgkkaqlnignvlpvgt\ 

mpegtivc/caleekp\gdrgk\lar 

asgny\atvishnp\etkkt\rvklpf 

rvqrrlspsankslwlvlvaggwp 

ecdkpi\lkag\ravpqi*rqkrncw 

\prvtgvwamnpfeafflkggnppa 

hrqspppirrdapagrkvgliaarr 

tgrlrgtktvqeken 

654 

6151 

A 

676 

21 

340 


655 

6152 

A 

677 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

656 

6153 

A 

678 

38 

529 

APSPDAMGHFTEEDKATITSLWGK 

VN VEDAG GETLGRLL V VYP WTQRF 

FDSFGNLSSASAIMGNPKVKAHGK 

KVLTSLGDAIK\HL\DDLKGTFAQA 

DVNLHC*QACMLDPE\NFQASWGN 

VL\VTRFWAIPFSGKEFHP*RCQAFL 

GRKMGDLELASALVPSRYH 

657 

6154 

A 

679 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 

sequence 

SEQ ID 
NO: of 

|JCp(l(JC 

sequence 

Me 
tho 

u 

SEQ ID NO: 
inUSSN 
(\Qrj70 lien 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







EVQASWQKMVTGVASALSSRYH 

658 

6155 

A 

680 

3 

545 

HSLFGTSEVINKLRSPDA\MGHFTEE 

DKATITSLWGKVNVEXDAGGETLGR 

LLWYPWTQ\RFFDSFGNLSSASAIH 

GQPPKSRHMGKKVLTSVLGDAIKHL 

\DDLKGHLLPKPEVKLH\CDKAALL 

DPEELSSFLGEMLLGDPFLGNPIFGQ 

KNFTP\EVARLSWAERWVTWSWPS 

ALVPSRYH 

659 

6156 

A 

681 

1 

432 


660 

6157 

A 

682 

334 

845 

AVRVRYVAFRYRAPRAVCLRLWSC 

RREVIHVPVRGKQGGKV\RAKAK\S 

RSSPRGPCRFPVGPSCTELLRKAGNY 

AER/MSGAGAPV*LGGRCLKYLTAE 

IPEAWLANAAA*QQRRPRIIPRHLAS 

SPIRNDEGS*TKLLGQKLTI\AQGGV 

LPNIQ\AVLLPKKDGESEGRRSK 

661 

6158 

C 

683 

392 

445 

MQPAVQRVGNLSRYFPS* 

662 

6159 

A 

684 

183 

481 


663 

6160 

A 

685 

253 

385 


664 

6161 

A 

686 

256 

374 


665 

6162 

C 

687 

354 

416 

MKESPGGELPQTGKKPVFLF* 

666 

6163 

A 

688 

2 

171 


667 

6164 

A 

689 

320 

584 

TRLPFDRPRATGCHQPVPSERRSPIS 
QDRLTHVQLLFTWNPSPIARPSKFSF 
EYLLL\PPRSCTCGGSHPGPKP*ASR 
LTAAALLLVAA 

668 

6165 

A 

690 

33 

494 


669 

6166 

A 

691 

1 

522 

PLKRSDGCNDGRPTRPPTRPDTTVF 

TSNLKQTRMVHLYTPEEKSAVTALW 

GKVNVDE\VGGKALGRLL\VVYPW\ 

TQRFFE\SFGDLSTP\DAV\MGNPKV 

KAHS\KKVLRGAF\SDGLAHL\DNLK 

GTFAHTEVSLHCDK\LHYVDP*RTFR 

LLGQRAWSVVAGPIHFWQKNFNPT 

SCRLA 

670 

6167 

A 

693 

241 

1104 


671 

6168 

A 

694 

95 

462 


672 

6169 

A 

695 

33 

494 


673 

6170 

A 

696 

1 

523 

PLKRSDGCNDGRPTRPPTRPDTTVF 

TSIAHTDTMVHLTPVE\KSAVTALW 

GKVNVDE\VGGKALGRLL\WYPW\ 

TQRFFE\SFGDLSTP\DAV\MGNPKV 

KAHS\KKVLRGAF\SDGLAHL\DNLK 

GTFAHTEVSLHCDK\LHRGSLKNFR 

LLGQRAWSVVAGPIHFWQKNFNPT 

SCRLA 

674 

6171 

A 

697 

318 

515 


675 

6172 

A 

699 

2 

648 


676 

6173 

A 

700 

137 

507 


677 

6174 

A 

701 

118 

375 

VAVVQIIFLPVFIAEKYKDLVPDNSK 
TADNATKNAEPLINLDVNNPDFKA 
GVMALANLLQIQRHDDYLVMLKU 
1RILVQERLTQD 

678 

6175 

A 

702 

1 

969 

AATVLTTIGEAPSFRSDSAPARPLAA 
SPVPAPPAPPRFFSPGRGPVDQSEKR 
WTMFRRKLTXSLDYHNPAGFNCKD 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; V=possibIe 
nucleotide insertion) 







ETEFRNFIVWLEDQKIRHYK1EDRG\ 

NLRNIHSSDWPK\FFEKYFKRC*TCP 

FKIQDRQESYLTGFFG\LAVRLEYG 

DNAEKYKDLVPDNSKTA*QLQLKI 

AEPLIN\LDVNNP\DFKAGVVGFG*T 

WLQIQRHNDGLPGQMLKANSGFWV 

QERLDHQGCQFA*GQIKQKRGLPV 

A\LDKHILGFDTGDAVLNEAAQILR 

LLHIEELRELQITCINEAIVAVQAIIA 

DPKTDHRLGKSLEDEHLRTSASHLL 

679 

6176 

A 

703 

105 

1591 


680 

6177 

A 

704 

110 

431 


681 

6178 

A 

705 

171 

1577 

GGNRATIQAGQCGNQIGAKFWGR* 

SVNEHGIRPHRHPTHGDSDPAAWT 

RNPPVYYNESHKVGK\YVPR\AILG 

GI*EPGEPWDSVR\SGSFLGPPKGEKI 

FPPFRPDNFVFGQSGAGNMWAKRP 

LAQEGAELWDSWLDVGTEGRQRS 

CD\CLQGFPA*PTSLGRGGTGSGMG 

TLLYQQGFEKEYPD\RIMN\TFSWP\ 

SPKCLDTVVQPYKATLSVHQLVEN 

TDETYCIDNEALYDMCFRTLKLTTP 

TYGDLNHLVSATMSGVTTCLRFPG 

QLNADLRKLAVNMVPFPRLHFFMP 

GFAPLTSRGSQQYRALTVPELTQQV 

FDAKNMMAACDPRHGRYLTVAAV 

FRGRMSMKEVDEQMLNVQNKNSS 

YFVEWIPNNVKTAVCDIPPRGLKM 

AVTFIGNSTAIQELFKR1SEQFTAMF 

RRKAFLHWYTGEGMDEMEFTEAES 

NMNDLVSEYQQYQDATAEEEEDFG 

EEAEEEA 

682 

6179 

A 

706 

1 

558 


683 

6180 

A 

707 

1306 

1459 

LASMCMCWIESHFCPPGPTGGSRRG 
PP/HLWLPGRSSGRSQRRLAESTEAP 
R 

684 

6181 

A 

708 

1073 

1324 


685 

6182 

A 

709 

1 

797 


686 

6183 

A 

710 

1 

3210 

MVKGSIQQEELTILNIYAPNTGALRF 

IKQVLRDLQRDLDSHTIIMGDFHTP 

LSTLDRSTRQKVNKDIQELNSALHQ 

EDLIDIYRTLHPKSTEYTFFSAPHHT 

YSKIDHIVGSKALLSKCKRTEIITNC 

LSDHSA1KLELRIKNLTQNRSTTWK 

LNNLLLNDYWVHNEMKAEIKMFFE 

TNENKDTTYQNLWDTFKAVCRGKF 

IALNAHKRKQERSKIDTLTSQLKEL 

EKQEQTHSKASRRQEITKIRAELKEI 

ETQKTLQNINESRSWFFERINKIDRP 

LARLIKKKREKNQIDAIKNDKGDIT 

TDPTEIQTTIREYYKHLYANKLENL 

EEMDKFLNTYTLPTLNQEEVESLNR 

PITGAEIVAIINSLPTKKSPGPDGFTA 

EFYQRYKEELVPFLLKPFQSIEKEGI 

LPNSFYEASIILIPKPGRDTTKKENFR 

PISLMNIDAKILNKILAKRIQQHIKN 

LIHHDQVGFIPGMQGWFNIRKSINVI 

QHINRAKDKNHMIISIDAEKAFDKI 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X-Unknown; *=Stop 
codon; /=possible nucleotide deletion; ^possible 
nucleotide insertion) 







QQPFMLKTLNKLDDMIVYLENPIVS 

AQNLLKLISNFSKVSGYKINIQKSQA 

FLYTNNRQTESQIMSELPFTIASKRI 

KYLGIQLTRDVKDLFK\ENHKPLLN 

EIKEDTNKWKNIPCSWVGRINIVKM 

AILPKV1YR/FNAIPIKLPMTFFTELE 

KTTLKFI WNQKRARI AKSILSQKNK 

AGGITLPDFKLYYKATVTKTAWYW 

YQNRDIDQWNRTEPSERTPHIYNYL 

IFDKREKNKQWGKDSLFNKWCWE 

NWLAICRKLKLDPFLTPYTKINSRW 

IKDLNVRPKTIKTLEENLGFTIQDIG 

MGKDFISKTPKAMATKAKIDKWDL 

I KLK SFCTAKETT J R VNRQPTK WEKI 

FATYSSDKGLISRIYNELKQIYKKKT 

NNPIKKWAKDMNRHFSKEDIYAAK 

KHMKKCSPSLAIREMQ1KTTMRYH 

LTPVRMAIIKKSGNNRCWRGCGEIG 

TLLHCWWDCKLVQPLWKAVWRFL 

RDLELEIPFDPAIPLLGIYPKDYKSC 

CYKDTCTRRKQLDCAEPVEPRKVG 

DGEWSLTKWTRPGSRALPWPPEQA 

KPYPPTLPTLAQDF 

687 

6184 

A 

711 

1 

2666 

MVKGS1QQEELTILNIYAPNTGAPRF 

IKQVLSDLQRDLDSHTLIMEDFNTP 

LSTLDRSTRQKVNKNTQELNSALH 

QADLIDIYRTLHPKSTEYTFFSAPHH 

TYSKIDHIVGSKALLSKCKRTEIITN 

YLSDHSAIKLELRIKNLTQSRSTTW 

KLNNLLLNDYWVHNEMKAEIKMF 

FETNENKDTTYQNLWDAFKAVCRG 

KFIALNAYKRKQERSKIDTLTSQLK 

ELEKQEQTHSKASRRQEITKIRAEL 

KEIETQKTLQKINESRSWFFERINKI 

DRPLARLIKKKREKNQIDTIKNDKG 

DITTDPTEIQTTIRESYKHLYANKLE 

NLEEMDTFLDTYTLPRLNQEEVESL 

NRPITGSEIVA1INSLPTKKSPGPDGF 

TAEFY/PESYL*QTHRQYHTEWAKT 

ASIPFENWHKTGMPSLTTPIQHSVG 

SSGQGNQPGEGNKGYSIRKRGSQIV 

PVCRRHDCLSRKPHRLSPKSP*ADK 

QLQQSLRIQNQCTKITSILIHQQQTN 

REPNHE*TPIHNCFKENKIPRNPTYK 

GCEGPLQGELQTTAQGNKRGHKQ 

MEEHSMLMGRKNQYRENGHTAQG 

NLQIQCHPHQATNDFLHRIGKNYFK 

VHMEPKKSPHRQVNPKPKEQSWRH 

HTT*LQTILQGYSNQNSMVLVPKQR 

YRSMEQNRALRNNAAYLQLSDL*Q 

T*EKQAMGKGFPI**MVLGKLASH 

M*KAETG SLPYTL YKNQFKMD*RF 

KR*T*NHKNPRRKPRHYH*GHRRG 

QGLHVQNTKSNGNKSQN*QMGSN* 

TKELLHSKRNYHQSEQATYNMGEN 

FRNLLI*QRANIQNLQ*TQTNLQEK 

NKQPHQKVGEGHEQTLLKRRHLCS 

QKTHEEMLIITGHQRNANQNHYEIS 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO- nf 
nucleo-tlde 
sequence 

SEQ ID 

peptide 
sequence 

Me 

I no 

d 

SEQ ID NO: 

in 1 ICSN 

09/770,160 

Nucleotide 

lUCituun ui 

first codon 
for peptide 
sequence 

Nucleotide 

■ucatiuii ui lasi 
codon Tor last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Un known; *«Stop 
codon; ^possible nucleotide deletion; Impossible 
nucleotide insertion) 







SHTS*NGNH*KVRKQQERQNS 

688 

6185 

A 

712 

1 

4371 


689 

6186 

A 

713 

1 

1849 

MVKGSIQQEELTILNIYAPNTGAPRF 

IKQVLSDLQRDLDSHTFIMGDFNTP 

LSTLDRSRRQEVNKDTQELNSALH 

QADLIDIYRTLHPKSTEYTFFSATHH 

TYSKIDHIVGSKAVLSKCKRTEIITN 

YLSDHSAIKLELRIKKLTQNRSTTW 

KLNNLLLNDYWVHNEMKAEINMF 

FETNENKDTTYQNLWDTFKA/EIQA 

TIREYYK\HLYTNKLENLEEMDKFL 

DTYTLPRLNQEKVESLNRPITGSEIV 

AIINSLPTKKSPGPDGFTAEFYQRYK 

EELVPFLLKLFQSIEKEGILPNSFYEA 

SIILIPKPGRDTTKKENFRPISLMNID 

AKILNKILANRIQQHIKKLIHHDQVG 

FIPGMQGWFNICKSINVIQHINRTKD 

KNHMIISIDAEKAFDKIQQPFRLKTL 

NKLGVDGTYLKIIRAIYDKPTANIIL 

NGQKLEAFPLKTGTRQGCPLSPLLF 

NIVLEVLARAIRQEKEMDVKDLFK 

ENYKPLLKEIKEDTNKWKNIPCSW 

VGRINIMKMVILPKDSTWAEVLVG 

DRRSGRLTEMLVIFLVFQSFSHSFLN 

TLMSLPSIFSSWPCFCSSQLVSCLRT 

CRSVCLSSAAGVSRVASLGNQKKR 

DLGSENIL 

690 

6187 

A 

714 

1 

1825 

MVKGSIQQEELT1LNTYAAHTGAPR 

LIKQVLSDLQRDLDSHTIIMGDFNTP 

LSTLDRSTRQKVNKDTQELKSALH 

QADLTDIYRTLHHKSTEYTFFSAPH 

HIYSKIDHILGSKALLSKCKRTEIITN 

YLSDHSAIKLELWIKNLTQNHSTTW 

ELNNLLLNDYWVHNEMKAEIKMFF 

ETNENKDTTYHNLWDTFKAVCRG 

KFIPLNAHKRKQERSKIDTLTSQLKE 

LEKQEQTHSKASRRQEITKIRAELK 

EIETQKTLQKINESRSWFFERJNKID 

RLLARLIKKKREKNQIDAIKNDKGD 

ITTDPTEIQTTIREYCKHLYANKLEN 

LEEMDKFLDTYTLPRLNQEEVESLN 

RPITGAEIVAIINSLPTKKSPGPDGFT 

AKFYQRYKEELVPFLLKLFQSIEKE 

GILPNSFYEASIILIPKPGRDTTKKEN 

FRPISLMNIDAKILNKKLAKRIQQHI 

KKLIHHDQVGFIPGMQGWFNIRKSI 

NVIQHINRAKDKNHMIISIDAEKAF 

DKIQQPFMLKTLNKL\GIKYLGIHLT 

RDVKDLFKENYKPLLKEIKEDTNK 

WKNIPCSWVGRINIVKMAILPKNILI 

TLQLLLVLPELSTLIPLWLPALAGQ 

691 

6188 

A 

715 

1 

3552 


692 

6189 

B 

716 

1 

3786 

MVKGSIQQEELTILNIYAPNTGAPRF 

IKQVLSDLQRDLDSHTLIMGDFNNP 

Lb 1 LDRSMRQKVNKDTQELNSALH 

QVDLIDIYRTLHHKSTEYRFFSAPH 

HTYSKIDHILGSKALLSKCKRTEIIT 

NYLSGHSAIKLELKIKNLTQNRSTT 


WO 01/79449 


SEQ ID 
NO: of 
nucleo-tide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


Me 
tho 
d 


SEQ ID NO 
inUSSN 
09/770,160 


Nucleotide 
location of 
first codon 
for peptic) 
sequence 


PCTYUS01/08656 


Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 


Amino acid sequence ( X=Unknown; *>=Stop 
codon; /"possible nucleotide deletion; V=possible 
nucleotide insertion) 


WKLNNLLLNDYWIHNEMKAEIKM 

FFETNENKDTTYQNLWDAFKAVCR 

GKFIALNAHKRKQERSKIDTLTSQL 

KELEKQEQTHSKAGRKKEITKIRAQ 

LKEIETQKTLKKLMNPGAEIQTTIRE 

YYKHLYAKKLENLEEMDKFLDTYT 

LPRLNQEEVESLNRPITGAEIVAIINS 

LPTKKSRTRWIHSRILPEEASIILIPKP 

GRDTTKKENFRPISLMNIDAKILNKI 

LAKRIQQHIKKLIHHDQVGFIPGMQ 

GWFNIHKSINVIQHINRAKDKNHIIIS 

IDAEKAFDKIQQPFMLKTLNKLGID 

GTYFKIIRAIYDKPTANIILNGQKLE 

AFPLKTGTRQGCPLSPLLFNIVLEVL 

ARAI RQEKEIKGIQLGKEEVQLSLFA 

DEM1VYLENPIVSAQNLLKLISNFSK 

VSGYKINVQKSQAFLYTNNRQTES 

QIMSELPFTIASKRIKYLGIQLTRDV 

KDLFKENCKPLLNEIKEDTNKWKNI 

PCSWVGRINIMKMAILPKVIYRFNAI 

PTKPPMTFFTELEKTTLKFIWNQKR 

AR1AKSILSQKNKAGGITLPDFKLYY 

KATVTKTAWYWYQNRDLDQWNR 

TEPSEITPH1YSYLIFDKPEKNKQWG 

KDSLFNKWCWENWLPICRKLKLDP 

FLTPYTKINSRWIKDLNVRPKTIKTL 

KENLGITIQDIGMGKDFMSKTPKAM 

ATKDKIDKWDLIKLKSFCTAKETTI 

RVNRQPTKWEKIFATYSSDKGLISRI 

YNELKQIYKKKTNNPINKWVKDMN 

RHFSKEDIYAAKKHMKKCSSSLAIR 

EMQIKTTMRYHLTPLRMAIIKKSGN 

NSASPTARNKTARNQRTKMIAVTA 

PRNRAPLELELILYRQNRQSKTHILE 

TNNTSAELLVPFEEDYLIEIRTVSDG 

GDGSSSEEIRIPKMSMIDHILPKSIPE 

ELQNGEGFGYIIMFRPVGSTTWSKE 

KVSSVESSRFVYRNESIIPLSPFEVK 

VGVYNNEGEGSLSTVTTVYSGEDD 

GYVFLWMVEPQLAPRGTSLQSFSA 

SEMEVSWNAIAWNKNTGRVLGYE 

VLYWTDDSKESMIGKIRVSGNVTT 

KNITGLKANTIYFASVRAYNTAGTG 

PSSPPVNVTTKKSRYLITTAYLEVPE 

I* 


693 


6190 


717 


3155 


694 


6191 


B 


718 


3414 


MVKGSIQQEELTILNIYAPNTGAPRF 

1KQVLSDLQRDLDSHTLIMGDFNNP 

LSTLDRSMRQKVNKDTQELNSALH 

QVDLIDIYRTLHHKSTEYRFFSAPH 

HTYSKIDHILGSKALLSKCKRTEIIT 

NYLSGHSAIKLELKIKNLTQNRSTT 

WKLNNLLLNDYWIHNEMKAEIKM 

FFETNENKDTTYQNLWDAFKAVCR 

GKFIALNAHKRKQERSKIDTLTSQL 

KELEKQEQTHSKAGRKKEITKIRAQ 

LKEIETQKTLKKLMNPGAEIQTTIRE 

YYKHLYAKKLENLEEMDKFLDTYT 


WO 01/79449 PCT7US01/08656 


SEQ ID 

ViKJl 01 

nuc!eo-tide 
sequence 

SEQ ID 
iSxJi ox 
peptide 
sequence 

Me 
tno 
d 

SEQ ID NO: 

in UooiN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location oi last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







LPRLNQEEVESLNRPITGAEIVAIINS 

LPTKKSRTRWIHSRILPEVQGGTEK 

EGILPNSFYEASIILIPKPGRDTTKKE 

NFRPISLMNIDAKILNKILAKRIQQHI 

KKLIHHDQVGFIPGMQGWFNIHKSI 

NVIQHINRAKDKNHIIISIDAEKAFD 

KIQQPFMLKTLNKLGIDGTYFKIIRA 

IYDKPTANIILNGQKLEAFPLKTGTR 

QGCPLSPLLFNIVLEVLARAIRQEKE 

IKGIQLGKQEVQLSLFADEMIVYLE 

NP1VSAQNLLKLISNFSKVSGYKINV 

QKSQAFLYTNNRQTESQIMSELPFTI 

ASKRIKYLG IQLTRDVKDLFKENCK 

PLLNEIKEDTNKWKNIPCSWVGRIN 

IMKMAILPKVIYRFNAIPTKPPMTFF 

TELEKTTLKFIWNQKRARIAKSILSQ 

KNKAGGITLPDFKLYYKATVTKTA 

WYWYQNRDLDQWNRTEPSEITPHI 

YSYLIFDKPEKNKQWGKDSLFNKW 

CWENWLPICRKLKLDPFLTPYTKIN 

SRWIKDLNVRPKTIKTLKENLGITIQ 

DIGMGKDFMSKTPKAMATKDKIDK 

WDLIKLKSFCTAKETTIRVNRQPTK 

WEKIFATYSSDKGLISR1YNELKQIY 

KKKTNNPINKWVKDMNRHFSKEDI 

YAAKKHMKKCSSSLAIREMQIKTT 

MRYHLTPLRMAIIKKSGNNSASPTA 

RNKTARNQRTKMIAVTAPRNRAPL 

ELELILYRQNRQSKTHILETNNTSAE 

LLVPFEEDYLIEIRTVSDGGDGSSSE 

EIRIPKMSTGGEEGMAAVFKNKCRC 

SWSRVVIAYHSSSGNQMGTNPEQD 

PGQHAIPLEGTLTHTRTHSDWDHLD 

TAMN* 


oiyZ 

A 

A 

Tin 

719 

1 

5127 


AG A 

Wo 

A t 

A 

720 

965 

9275 


by f 

OI94 

A 

721 

3 

376 


698 

6195 

A 

722 

1 

380 


699 

6196 

A 

723 

104 

462 


700 

6197 

A 

724 

762 

902 


701 

6198 

A 

725 

78 

747 

LRRGRSRETNEEPPPPTVQVQGPGP 

QREEKQKTKMAKFVIRPATAADCS 

DILRLIKELAEYEYMEEQVILTEKDL 

L\EDGFG\EHPFYHCLVAEVPKEHW 

TSEG\HSIV\GFAM\YYFTYVDPW\IGQ 

VICILEDFFYVM\SDYRGSGIGSEILK\ 

NLSQ\VAMRCRCSSMHFLG*PEW\N 

EPSIVNFY\KRRGAS\DLSS*RRGW\RL 

FQGSDKGVIWLKNGPTEGVEGVAC 

C 

702 

6199 

A 

726 

149 

460 


703 

6200 

A 

727 

1 

501 


704 

6201 

A 

728 

1 

391 

SPLNKVQLINELNEREVQLGVANK 

VSWHSEYKDSAWIFLGGLPYDLT\K 

GDIICVFSQ\QRSTIVAVDNFNGIKIK 

GRTIRVDHVSNYRAPKDSEEIDDVT 

RQLQEKGCGARTPSPSLSESSEDEK 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nuclco-tidc 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770 160 

Nucleotide 
location of 
firct rodon 

HI &\ tvUvll 

for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possiblc nucleotide deletion; V=possible 

nilptpntifll* incprriAnl 

IIUV.ILUIIUC IUSI.I UVltf 







PTKKP 

705 

6202 

A 

729 

18 

240 


706 

6203 

A 

730 

254 

1223 

SPLTRVKLINELNEREVQLGVADKV 

FWHSEYKDSAWIFLGGLPYGLTNEG 

DIICVFSQYGEIVNINLVRDVKKTGK 

SKGFCFLCYEDQRST1LAVDNFNGI 

KIKGRTIRVDHVS\NYRAPKDSEDID 

DVTRQLQEKGSGARPPSPTLSESSE 

DEKPTKKHKKDKKXEKKKKKKEKE 

KADREVQAEQPSSSSPRRKTVKEKD 

DTGPKKHSSKNSERAQKSEPREGQ 

KLPKSRTAYSGGAEDLERELKKEKP 

KHEHKSSSRREAREEKTRIRDRGRS 

SDAHSSWYNGRSEGRSYRSRSRSR 

DKSHRHKRARRSRERESSNPSDRW 

RH 

707 

6204 

A 

731 

2143 

2346 


708 

6205 

A 

732 

2016 

2206 


709 

6206 

A 

733 

90 

401 


710 

6207 

A 

734 

276 

488 


711 

6208 

A 

735 

186 

537 • 

IWFPLRRRKARQEEKSGLGAPRSPS 

HNYPPGYLGCLGKTNTS*TYILDQS 

NIGKRVA\AILN*ILGGRKLRLEKSL 

SCQPKVEELYERVAW/1P*KPGCLLL 

VSVKVRNVFDWCTWVY 

712 

6209 

A 

736 

3 

318 


713 

6210 

A 

737 

1 

280 

REPTMVLSPADKTNVKAAWGKVG 
AHAGEYGAEALERMFLSFPTTKT\P 
VNFKLLSHCLLV/TLAAHLPAEFTPA 
VHASLDKFLGSVSTVLTSKYR 

714 

6211 

B 

738 

34 

264 

MVLSPADKTNVYFPHFDLSHGSAQ 
VKGHGKKVADALTNAVRTVDDMP 
NALSALSDLHAHKLRVDPVNFKLL 
STACW* 

715 

6212 

A 

739 

3 

190 

EPTMVLSPADKTNVKAAWGKVGA 
H/AGEYGAEALERMFLSFPTTKIQIP 
LSWSLGGHASCPLG 

716 

6213 

B 

740 

12 

298 

MVLSPADKTNVKAAWDLLPALRPE 

PRLCQVKGHGKKVADALTNAVAH 

VDDMPNALSALSDLHAHKLRLAW* 

717 

6214 

A 

741 

2 

392 

QTQREPTMVLSPADKTNVKAAWG 

KVGAH/AGEYGAEALERMFLSFPTT 

KTYFPHFDLSHGSAQVKGHGKKVA 

DALTNAV/AHVGGPVNFKLLSHCLL 

VTLAAHLPAEFTP\AVNASLDKFLV 

SVSTVLTSKYR 

718 

6215 

A 

742 

623 

1235 

SNLVELSNTLSWSSGGKVGAHAGE 

YGAEALERMFLSFPTTKTYFPHFDL 

SHGSAQVKGHGKKVADALTNAVA 

HVDDMPNALSALSDLHAHKLRVDP 

VNF\KLL\SH\CLLVDPGPAHFPAEF\ 

TPAVHASLDKSTKTYFPHFDLSHGS 

AQVKGHGKKVADALTNAVAHVDD 

MPNALSALSDLHAHKLSVDPGNFK 

LPSHLPAGDPC 

719 

6216 

A 

743 

117 

403 


720 

6217 

C 

744 

62 

370 

MKSMRKQAPIITAFILTSRSKGNWIP 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: 01 
nucleo-tide 
sequence 

SEQ ID 

(NO: oi 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
Tor peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=l)nknown; *=Stop 
codon; /^possible nucleotide deletion; ^possible 
nucleotide insertion) 







KLSASVNASLKIPVQCLEILPTTHCS 
SRDLIFQKFNLLMNQYLIYLGMLSV 
DTEEDTQLASLFPGEKHSSVSFVCP* 

721 

6218 

A 

745 

3 

1242 

AAPQAGLSPVAIAAA1QLHLHSTQC 

SSPNTCCLPRRTRATIYYSRWSYHP 

LGSVP*SP*PFQEAS/ALTLPPACSFY 

GPLT*FQPKP* GSFPLSQ*MEYTIGL 

YT*TFHCPGTSRRQIPSSYLNCKDAF 

LPLL/SNPPQCRPFTGVGLVDVLTGF 

ETNNKYEIKNSFGQRVYFAAEDTD 

CCTRNCCGPSRPFTLRIIDNMGQEVI 

TLERPLRCSSCCCPCCLQEIKSLDEQ 

CWGKISKYWTGILREAFTDADNFG 

IQFPLDLDVKMKAVMIGACFLIDRN 

CSPAMEQSWMENYFDEMTEIGFRR 

SVITNFSELKEHVLTHCKEANKNLD 

KMLDEWLTRKNSVEKTLNELMEV 

KTJNEKLTIGKISKYWSGFVNDVFT 

NADNFGIHVPADLDVTVKAAMIGA 

CFLFAFRLGSELHN 

722 

6219 

A 

747 

129 

1235 

EGCAAAVPDSLEAQKRKPSPGPGSL 

DLVSLGSGNSGSQRTVLIMDKQNS 

QMNASHPETNLPVGYPPQYPPTAFQ 

GPPGYSGYPGPQVSYPPPPAGHSGP 

GPAGFPVPNQPVYNQPVYNQPVGA 

AGVPWMPAPQPPLNCPPGLEYLSQ1 

DQILIHQQIELLEVLTGFETNNKYEI 

KNSFGQRVYFAAEDTDCCTRNCCG 

PSRPFTLRIIDNMGQEVITLERPLRCS 

SCC\CPCCLQEIEIQAPPGVPIGYVIQ 

TWHPCLPKFTIQNEKREDVLK1SGP 

CWCSCCGDVDFEIKSLDEQCVVG 

KISMCHWTGILREAFTDADNFGIQFP 

LDLDVKMKAVMIG\ACFLI\DFMFF\ 

ESTGQPGNKNSGVWWGFS 

723 

6220 

A 

748 

647 

797 


724 

6221 

A 

749 

2 

424 


725 

6222 

A 

750 

2 

460 

ARAHTHREPTMVLSPADKTNVKAA 

WGKVGAHAGEYGAEALERMLLSF 

PTTPTYFPHFDLNHGSAHVKGHGK 

NVDDALTNAVTHVYYMPNSLYALS 

DLHPHNLRMDPVNFMLLSHCLL*T 

LVVHLPAELTPAVHASLNNVLESER 

TELTSSTS 

726 

6223 

A 

751 

1 

456 

i 

RPRRPQREPTMVLSPADKTNVKAA 

WGKVGAHAGEYGAEALE/RMFL/SF 

PTTKTYFPHFDLSHGSSQVKGHGKK 

VADALTNAVGHVDDMPNALSALS 

DLHAHKLRVDPVNFKLLSHCLLVT 

LAAHLPAEFTPAVHAFLDKFLASVS 

TVLTSKYR 

727 

6224 

A 

752 

1 

594 

PRLFWSPQTQREPTMVLSPADKTN 

VKAAWGKVGAHAGEYGAEALER 

MFLSFPTTKTYFPHFDLSHGF\AQVK 

GATGKKVDD\ALTKRRGAPLDDMP 

NALVRPLKRPCTTHKAFGVEPGSTS 

KLUSHLPCLGEPWAAHLPRPSFNP 

WRLQRLPWGQSFLGFLVEEPLLEPS 


WO 01/79449 PCT/USO 1/08656 


SEQ ID 

NO- nf 

nucleo-tide 
sequence 

SEQID 

MA. n f 

peptide 
sequence 

Me 
i no 
d 

SEQ ID NO: 

09/770,160 

Nucleotide 
location of 
first cod on 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







JMr V JsJV WKPS VGHCFFAP WGFPPAP 

728 

UZ>ZrJ 

A 

7S3 

9 

Z 

3R6 


720 


A 

7S4 

33 

476 


730 


A 
rV 

7SS 

j 

417 

HI/ 


731 

699 R 

A 
/\ 

7^6 

1 
1 

419 
*+ 1Z 


73? 


A 

7^7 

o 
z 

44 6 


733 

6930 

A 
£\ 

7SR 

/JO 

j 

711 


/ j*f 

6911 
OZ j i 

A 


87 
o/ 

ZJO 


73^ 

69 10 
OZjZ 

A 

76A 
/OI/ 

1 fit 

Jzz 


736 

6233 

A 

761 

213 

427 


737 

6234 

A 

762 

213 

422 


738 

6235 

A 

763 

1 

732 


739 

6236 

A 

764 

31 

1074 

TLILSGFTVKQVYAIDQIFSSLRLTIT 

IKMFCGDYVQGTIFPAPNFNPIMDA 

QMLGGALQGFDCDKDMLINILTQR 

CNAQRMMIAEAYQSMYGRDLIGD 

M\REQLSDHFQDVMAGLMYPPPLY 

DAHELWHAMKGVGTDENCLIEILA 

SRTNGEIFQMREAYCLQYSNNLQE 

DIYSETSGHFRDTLMNLVQGTREEG 

YSDPAMAAQDAMVLWEACQQKTG 

GHKTMLQMILCNKSYQQLRLVFQE 

FQNISGQDMVDAINECYDGYFQELL 

VAIVLCVRDKPAYFAYRLYSAIHDF 

GFHNKTVIRILIARSEIDLLTIRKRYK 

ERYGK\SLFHDIRNF\ASGHYKKSTG 

LPIC 

740 

6237 

A 

765 

613 

926 


741 

6238 

C 

766 

79 

405 

MIGGTPQMFFISGAKGQWSPSLQPP 

PRAHRSSPWAPSSKSTSGGTAALGS 

LGSKDYFPRTGDGWELRRSDQRR 

AHLPGCPTVLRTLLPQQRGDRDLQ 

QLRHHELRSL* 

742 

6239 

A 

767 

I 

321 


743 

6240 

A 

768 

110 

431 


744 

6241 

B 

769 

756 

1533 

MREIVHIQAGQCGNQIGAKFWEPW 

KASSIELSQCRNSPSKVFRSKEHDGL 

PVTPPTRR* 

745 

6242 

A 

770 

20 

453 

GIPGSTISLFCSEKKLREVERIVKAN 
DREYNEKFQYADNRIHTSKYNILTF 
LPINLFEQFQRVANAYFLCLLILQLI 

TVT^TOOf * I'll nrvrTTl PWV V n T rtrm /m a ■« r-wr-r—* 

PEISSLTWFTTIVPLVLV1TMTAVKD 
ATDDMLQNEKWMNVKVGDIIKLEN 
NQFVAADLLLLSSSEPH 

t 746 

6243 

A 

771 

1 

1014 


747 

6244 

A 

772 

128 

2654 

LVQDHKAGEHQVGAMARLGNCSL 

TWAALI1LLLPGSLEECGHISVSAPIV 

HLGDPITASCIIKQNCSHLDPEPQ1L 

WRLGAELQPGGRQQRLSDGTQESII 

TLPHLNHTQAFLSCCLNWGNSLQIL 

DQVELRAGYPPAIPHNLSCLMNLTT 

SSLICQWEPGPETHLPTSFTLKSFKS 

RONCQ I QGDSILDCVPKDGQSHCCI 

PRKHLLLYQNMGIWVQAENALGTS 

MSPQLCLDPMDVVKLEPPMLRTMD 

PSPEAAPPQAGCLQLCWEPWQPGL 


WO 01/79449 PCT/US01/086S6 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Vucleotide 
locution of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X-Unknown; *=Stop 
cuuun t / — pu»iDic nucicouue uciciioii) \ — posaioic 
nucleotide insertion) 







HINQKCELRHKPQRGEASWALVGP 

LPLEALQYELCGLLPATAYTLQIRCI 

RWPLPGHWSDWSPSLELRTTERAP 

TVRLDTWWRQRQLDPRTVQLFWK 

PVPLEEDSGRIQGYWSWRPSGQAG 

AILPLCNTTELSCTFHLPSEAQEVAL 

VAYNSAGTSRPTPWFSESRGPALT 

RLHAMARDPHSLWVGWEPPNPWP 

QGYVIEWGLGPPSASNSNKTWRME 

QNGRATGFLLKENIRPFQLYEIIVTP 

LYQDTMGPSQHVYAYSQEMAPSH 

APELHLKH1GKTWAQLEWVPEPPEL 

GKSPLTHYTIFWTNAQNQSFSAILN 

ASSRGFVLHGLEPASLYHIHLMAAS 

QAGATNSTVLTLMTLTPEGSELHHL 

GLFGLLLLLTCLCGTAWLCC\APTG 

R1PSGQVSQTQLTAAWAPGCPQSW 

RSCPDPDRDSGWGRHLK*AVLSPHI 

LVCRMPSSCPALARHPSPSSQCWRR 

MKRSRCPGSPITAQRPVASPLWSRP 

K ifPPD PTACAITIJDCDMDOI ADA ID OTTA A 

MCoKG 1 QfcQr rroriNroLAr AlKorM 

GSCWAAPQAQGQGTISAVTPLSPS 

WRASPPAPSPMRTSGSRPAPWGPW 

748 

6245 

A 

773 

123 

2486 


749 

6246 

A 

774 

128 

2573 

LVQDHKAGEHQVGAMARLGNCSL 

TWAALIILLLPGSLEECGHISVSAPIV 

HLGDPITASCIIKQNCSHLDPEPQIL 

WRLGAELQPGGRQQRLSDGTQESII 

TLPHLNHTQAFLSCCLNWGNSLQIL 

DQVELRAGYPPAIPHNLSCLMNLTT 

SSLICQWEPGPETHLPTSFTLKSFKS 

RGNCQTQGDSILDCVPKDGQSHCCI 

PRKHLLLYQNMGIWVQAENALGTS 

MSPQLCLDPMDVVKLEPPMLRTMD 

PSPEAAPPQAGCLQLCWEPWQPGL 

HINQKCELRHKPQRGEASWALVGP 

LPLEALQYELCGLLPATAYTLQIRCI 

RWPLPGHWSDWSPSLELRTTERAP 

TVRLDTWWRQRQLDPRTVQLFWK 

PVPLEEDSGRIQGYWSWRPSGQAG 

AILPLCNTTELSCTFHLPSEAQEVAL 

VAYNSAGTSRPTPWFSESRGPALT 

RLHAMARDPHSLWVGWEPPNPWP 

QGYVIEWGLGPPSASNSNKTWRME 

QNGRATGFLL'KENIRPFQLYEirVTP 

LYQDTMGPSQHVYAYSQEMAPSH 

APELHLKHIGKTWAQLEWVPEPPEL 

GKSPLTHYTIFWTNAQNQSFSAILN 

ASSRGFVLHGLEPASLYHIHLMAAS 

QAGATNSTVLTLMTLTPEGSELHHL 

GLFGLLLLLTCLCGTAWLCC\APTG 

RIPSGQVSQTQLTAAWAPGCPQSW 

RRMPSSCPALARHPSPSSQCWRRM 

KRSRCPGSPITAQRPVASPLWSRPM 

pQpryrrnsnFPPWWPQT APAtpc?pK/fn 

SCWAAPQAQGQGTISAVTPLSPSW 
RASPPAPSPMRTSGSRPAPWGPW 

750 

6247 

A 

775 

151 

273 



WO 01/79449 


PCTAJS01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
□ 

SEQ ID NO: 
in USSN 

Nucleotide 
location of 

iirsi cooon 
for peptide 
sequence 

Nucleotide 
location of last 

f*nrlnn fnt* tact 
LUUUII lUr 1HAI 

amino acid of 

peptide 

sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 

iiuuculiul iiiacruuni 

751 

6248 

A 

776 

785 

920 


752 

6249 

A 

777 

332 

473 


753 

6250 

A 

778 

264 

387 


754 

6251 

A 

779 

257 

354 


755 

6252 

A 

780 

101 

290 


756 

6253 

A 

781 

21 

215 


757 

6254 

A 

782 

158 

955 

KMTSSSEQEEDEKNNQSATPRQTGP 

ATTMNSKGQYPTQPTYPVQPPGNP 

VYPQTLHLPQAPPYTDAPPAYSELY 

RPSFVHPGAATVPTMSAAFPGUSL 

YLPMAQ\SVAVGPL\GSTIPMAYYP 

VGPIYPPXGSTWLGGKGGYDAGARF 

GAGATAGNIPPPPPGXCPPNAAQLA 

VMQGANVLVTQXRKGNFFMGGSDG 

GYTHLVRNQGHLCAREKTSHTLQH 

FSQCNCFSHINLKLQFRHMLLGCLS 

GAQTFRHFSNLIRNHVMVAVPP 

758 

6255 

A 

783 

167 

342 


759 

6256 

A 

784 

368 

525 


760 

6257. 

A 

785 

311 

487 


761 

6258 

A 

786 

148 

298 


762 

6259 

A 

787 

164 

314 


763 

6260 

A 

788 

232 

382 


764 

6261 

A 

789 

2 

390 


765 

6262 

A 

790 

3 

376 

AQKAGLGTIFIMTCSPLLLTLLIHCT 

GSWAQPVLTQPPSVSAAPGQKVTIS 

CSGSGSNIGNNYVSWYQQLPDPLFH 

AHK*LLPGSRDSGLEAR*QPRQGGS 

GDHHTLQTKQQQVRGQQLPEPDA 

766 

6263 

A 

791 

2 

353 


767 

6264 

A 

792 

2 

382 


768 

6265 

A 

793 

3 

654 


769 

6266 

A 

.794 

9 

885 


770 

6267 

A 

795 

1 

412 


771 

6268 

A 

796 

2 

616 

WPIEIDIQCGGIPRDNLHHDLLPSPP 

HPSHCPPTRPAVSAEGRTRDQSSSM 

TCSPLLLTLLIHCTGPWAQSVLTQPP 

SVSATPGQRVTISCSGSRSN1GDNYV 

SWYKQLPGTAPQLLIYDNNKRTSGI 

PDRFSGSKS\GTSATLG1TGLQTGDE 

ADYYCGTWDTILSAGVFGGWTKLT 

VLGQPKAAPSVTLFPPSSEELQANK 

AT 

772 

6269 

A 

797 

489 

715 


773 

6270 

A 

798 

20 

371 


774 

6271 

A 

799 

181 

382 


775 

6272 

A 

800 

353 

479 


776 

6273 

A 

801 

3 

368 

HEAASSSSASPFQTKIEKMVDLTQV 
MDDEVFMAFASYATIILSKMMLMS 
TATAFYILTRKVFANPQHCVTFGKG 
ENAKKYLRTDDRV*RVRRAHLNDL 
ENIIPFLGIGLLYSLSGADPSTAI 

777 

6274 

A 

802 

246 

363 


778 

6275 

B 

804 

55 

366 

MGHFTEEDKATITSLWGKVNVEDA 
GGETLGRLLVVYPWTQRFFDSFGN 
LSSASAIMGNPKVKAHGKKVLTSL 
GDAIKHLDDLKGTFAQLPHRLVIVA 


WO 01/79449 PCT/US01/08656 


SEQ ID 

VH\J* 01 

nucleo-ttde 
sequence 

SEQ ID 
PiU: 01 
peptide 
sequence 

Me 

inu 
d 

SEQ ID NO: 

in TICCM 

tn uJwfN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location oflast 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=l)nknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







LSSSVK* 

/ /y 

A77£ 

A 

oUj 

no 



780 

6277 

A 

806 

24 

253 


781 

6278 

A 

807 

32 

433 


782 

6279 

A 

808 

15 

468 


783 

6280 

A 

809 

25 

1404 

APSPDAMGHFTEEDKATITSLWGK 

VNVEDAGGETLGRLLWYPWTQRF 

FDSFGNLSSA\SAIMGNPKVKAHGK 

KVLTSLGDAIKHL\DDLKGTF\AQLE 

*TCTCDKL\H\VDPENFKLLGVNVLV 

TVL\AIHF\GKEFTPE\VQSFLGRKMV 

TGVASALSFPDYH 

784 

6281 

A 

810 

113 

387 


785 

6282 

A 

811 

1330 

1465 

SECCGLSRPGHWPHFI*WLPSL/CLI 

DVPT*QRKGGLVRNWVLPG*NLWE 

LLP/ALAGSGEGHLKNMTG SKLSRM 

PNRISDSESE/GVNTARIHGEMFWR 

GDNWACTCCRGARSLSAADSADPA 

TGLTSFPLASASSSATRASIPKRCLN 

SWFSTTRP 

786 

6283 

B 

812 

17 

718 

MVVVAAAPNPADGTPKVLLLSGQP 

ASAAGAPAGQALPLMVPAQRGASP 

EAASGGLPQARKRQRLTHLSPEVPS 

LPRKLKNR V AA QTARDRKKARM SE 

LEQQVNQKLLLENQLLREKTHGLV 

VENQELRQRLGMDALVAEDFCLLQ 

SDILLG ILDNLDP VMFFKCPSPEPAS 

LEELPEVYPEGPSSLPASLSLSVGTS 

SAKLEAINELIRFDHIYTKPLVLEIPS 

DTG* 

787 

6284 

A 

813 

464 

714 


788 

6285 

A 

814 

349 

581 


789 

6286 

A 

815 

223 

513 

DHEEPQAREGDQSVHRPHAERTGQ 
PGMWRHPRLDECQPQELL/TKHSTS 
PSQEKEVHTPHP/RPLESCWASLNR 
DPQHHSSPTPGKTSKSRENKEIISQ 

790 

6287 

A 

816 

384 

464 

PLPQLLRFAQPKPEAHLTPARPQPK 

RTCHGLTCRRGVSPGWRRDGWPRT 

HRSAGATRRPIQETASPVPQPEAAPP 

HRARGSGKMRDGKPGAGNTERRD 

PQSRTVGLNKKNSTPHQSPQPPADV 

♦TSAGG 

791 

6288 

A 

817 

1 

255 

IVMGHSMLPHF*IW\SPPPGRAAARL 
APLSGAGHSGPRLAPWT*AGQLQT 
QSLVR\P*PELGKSELSAPSLVIGSW 
MDM*PKPGQ 

792 

6289 

B 

818 

191 

1072 

MWRSCLRLRXRGTPSPESAGGWPQ 

RFYESGANHPVSSPGLRPADRKEEV 

LFRMFSIHTGEALA1AVATEWDSQQ 

DTIKYYTMHLTTLCNTSLDNPTQRN 

KDQLIRAAVKFLDTDTICYRVEEPE 

TLVELQRNEWDPIIEWAEKRYGVEI 

SSSTSIMGPSIPAKTREVLVSHLASY 

N I WALQGIDGSRPCCCSRLEEEYQI 

PEVGGNIEWAHDYELQELRARTAA 

GTLFIHLCSESTTVKHKLLKE* 

793 

6290 

A 

819 

1518 

1891 



WO 01/79449 PCT/US01/086S6 


SEQ ID 

INU; 01 

nucleo-tide 
sequence 

SEQ ID 
NO: ol 
peptide 
sequence 

Me 
tno 
d 

SEQ ID NO: 
in UoofN 
09/770,160 

Nucleotide 
locution of 
first codon 
for peptide 
sequence 

Nucleotide 
loc&tion of lust 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; V=possible 
nucleotide insertion) 

794 

6291 

A 

820 

217 

491 


795 

6292 

A 

821 

1789 

2411 

KTYWRKKVEKVVVSNR\LVTSPC\C 

IVTSTYGWTANMGENH*KLQALKE 

TTSTMG/YYMASQRKHRGIKPLTLSI 

IEYLKAKRPEGLIRTDKSWKDLWIL 

LY\ETALLSSGFQSWKIPRHHA*QVS 

YRMIKL\GLGIDEDGPYLLDDTSA\A 

VNLKELPPxLEGDDDTFTHGKEVGLI 

LLGLRGWTLPVSVLYNSSDNIFFQG 

CFPLFLVNI 

796 

6293 

A 

822 

592 

1122 


797 

6294 

A 

823 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

798 

6295 

A 

824 

38 

531 

APSPDA\MGHFTEEDKAT1T\SLWGK 

VNVEVDAGGETLGRLLVVYPWTQR 

FFD\SFGNLSSASAI\MGNP\KVKAH 

GKKVLTSLGRCHKSTWDDLKG\TF 

AQA* SELHL*QSCN VDPENFK\LLG\ 

NVLVTRFGQSHFRQKNFTPEGCRAS 

WAE/MMGDLQLASALVPSRYH 

799 

6296 

A 

825 

1 

278 


800 

6297 

A 

826 . 

80 

591 

RGCKREGLSMSSLIRRVISTAKAPG 

A\IGPPTVQAVLV\DRTHLHFRDQIG 

HGPLPSWTSLCPGGVAGRSLNKLL 

KNMGEIPESLPGCDFATNVVKTTCSS 

GLDINDLQLLFNEILQTVFSRSNFPA 

RAAYPSWLLLPQKGSRTAEIEAWAIQ 

GPLTTAFILSGDPCCWWDC 

801 

6298 

A 

827 

1 

396 


802 

6299 

A 

828 

1 

346 


803 

6300 

A 

829 

3 

720 

RGIPASRWARKAWLLCASDLLLLL 

LLLPPAG\SGRAEGSPGTP\DEFTP\PP 

RKKKKDIRDSNDADMARLLEH\WE 

KHDDAEEGDLPEHKRPSAPVDFSKI 

DPG\KPESILKMTKKGKT\LMMFVT 

VSGSPTEKETEEITSLWQG\SLFNAN 

YDVQRFIVGSDRAIFMLRDGSYAW 

EIKDFLVGQDRCADVTLEGQVYPG 

KGGGSKEKNKTKQDKGKKKKEGD 

LKSRSSKEENRAGNKREDL 

804 

6301 

A 

830 

349 

567 


805 

6302 

A 

831 

1098 

1684 


'806 

6303 

A 

832 

2 

441 

PCRNSRVENFVSMWVCSTLWRVRT 

P\PGSG/GGLLPASGCHGPAASSYSA 

SAEPARVRALVYGHHGDPAKVVET 

VIPGHTWQLRNVA*PTLRR*FERNT 

HSSLDDMNISVWLCA*\LKNLELAA 

VRGSDVRVKMLAAPINPSDINMIQG 

807 

6304 

A 

833 

3 

421 

ASMWVCSTLWRVRTP\PGSG/GGLL 

PASGCHGPAASSYSASAEPARVRAL 

VYGHHGDPAKWEGITRELFQRFP 

WIFLQLITAVISSASTVLKNLELAAV 

RGSDVRVKMLAAPINPSD1NMIQGN 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 
tho 

u 

SEQ ID NO: 
inUSSN 

Uy/770,loU 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=5top 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







YGFLPELPAVGGNEGV 

808 

6305 

A 

834 

2 

611 

ILQLGRGRAVRVCSTLWRVRTP\P\G 

SGG/GLLPASGCHGPAASSYSASAEP 

ARVRALVYGHHGDPAKVVELKNL 

ELAAVRGSDVRVKMLAAPINPSDIN 

MIQGNYGLLPELPAVGGNEGVAQV 

GAEGSNVTGLKPGNWWPAXNAGL 

RTWRNRG*VHPKEALIQVPSDIPLQ 

SAATLGVNPCTAYRMLMDFEQLQP 

GDSVIQNAS 

809 

6306 

A 

835 

159 

312 


810 

6307 

A 

836 

637 

974 


811 

6308 

A 

837 

240 

419 


812 

6309 

A 

838 

20 

283 


813 

6310 

A 

839 

508 

715 

IPGNFEPSRLGRG*KTQACSPSLLWE 
FWLTQYLPALGAG\HILKNFTTFPVI\ 
SCVSKLSTLFGGKMPEN 

814 

6311 

A 

840 

3 

362 


815 

6312 

A 

841 

7 

479 

GAIMGVDIAINKDRRVRRKEPKSQD 

IYLRLLVKLYRFLARRTNSTFNQVV 

LKRLFMSRTNRPPLSLSRMIRKMKL 

PGRENKTAVVVGTITDDVRVQEVP 

KLKVCALRVTSRARSRILRAGGKIL 

TFDQLALD/SPYVRSKGRKFERARG 

RRASRGYKN 

816 

6313 

A 

842 

2 

723 

CAVNSAEQRGAIMVSGHLFITKDRK 

VR\RKEP\KSQDIYLRLLVKLYRFLA 

RRTNSTFNQWLKR\LFMSPHQPGP 

PLSLSXRMIPED* SFPGPGKQRRA W 

VG\TITD\DVRVQE\VPKTERVCCTC 

AVDQAGAPQAA1LARAGGQDSFTFR 

PSLALGTSPKGLVGTCSWLFRFPRQ 

RGPRRWYPAIFGKGPQGTPAQATP 

KPYV\RSKGPKFERARG\RRAS\RGY 

KKLTLDPTLLLKKFLPDKKK 

817 

6314 

A 

843 

1221 

2238 

EPLIVCVCFFCLCPPLFFFSFLGSAEK 

AVLEQFGFPLTGTEARCYTNHALSY 

DQAKRVP\RWVL\EHIFQKAR*\MG 

DADRKHCKFKPDPNIPTTFSAFNEN 

YVGSGWSRGHMAPAGNNKFSSKA 

MAETFYLSNIVPQDFDNNSGYWNRI 

EMYCRELTERFEDVWVVSGPLTLP 

QTRGDGKKIVSYQVIGEDNVAVPS 

HLYKVILARRSSVSTEPLALGAFVV 

PNEAIGFQPQLTEFQVSLQDLEKLS 

VLVFFPHLDRTSDIRNIGSVDTCKLL 

DFQEFTLYLSTRKIEGARSVLRLEKI 

MENLKNAEIEPDDYFMSRYEKKLE 

ELKAKEQSGTQIRKPS 

818 

6315 

A 

844 

1 

306 


819 

6316 

A 

845 

216 

339 


820 

6317 

A 

846 

425 

553 


821 

6318 

A 

847 

190 

334 


822 

6319 

A 

848 

241 

435 


823 

6320 

C 

849 

280 

450 

MLEKNWCPSLQVPIILNWAQPCGKI 
LTECCTLGYSLIQGDFWTFIRKHAR 
TRLVKR* 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO* nf 

nucleo-tide 
sequence 

SEQ ID 

NO* nf 

peptide 
sequence 

Me 

inu 

d 

SEQ ID NO: 

in irc£N 

in uooii 
09/770,160 

Nucleotide 
locution of 
first codon 
for peptide 
sequence 

Nucleotide 
location oi iasi 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; A=possible nucleotide deletion; \=possible 
nucleotide insertion) 


Oj£ 1 

A 


i 
i 

JXJI 


825 

6322 

A 

851 

2 

3484 


826 

6323 

B 

852 

225 

326 

MAFKDTGKTPVEPEGAIHRIRITLTS 
KKRKarEK* 

827 

6324 

A 

853 

348 

515 

AFKDTGKTPVE\PELAIHRI\RITLTS\ 

RNVKSLEKWSAFVMRGGGGIGRK 

ATSFTR 

828 

6325 

A 

854 

42 

529 

SARSLLHDSPHVRSRRGTKSVRKPA 

RNRPLAFKDT\GKTPVEPEV\AIH\RI 

RJTPNKAANVK\SLEKVVCLTLIRRA 

QKEKNFQS*KGPVS/RLPYPRFLR1H 

FQGKTPCGLKVFKDVGVRFPRWR1 

HK\RLI\DLHSPS\EIVKQITFHQYLSP 

GVEVEVHHLQML 

829 

6326 

A 

855 

14 

345 


830 

6327 

A 

856 

1 

396 


831 

6328 

A 

857 

3 

718 

RGIPASRWARKAVVLLCASDLLLLL 

LLLPPAG\SGRAEGSPGTP\DEFTPPP 

RKKKKDIRDSNDADMARLLEH\WE 

KHDDAEEGDLPEHKRPSAPVDFSKI 

DPSKPESILKMTKKGKT\LMMFVTV 

SGSPTEKETEEITSLWQG\SLFNANY 

DVQRFI VGSDRAIFMLRDG S YA WEI 

KDFLVGQDRCADVTLEGQVYPGKG 

GGSKEKNKTKQDKGKKKKEGDLK 

SRSSKEENRAGNKREDL 

832 

6329 

A 

858 

80 

349 


833 

6330 

A 

859 

504 

738 


834 

6331 

A 

860 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLAIHFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

835 

6332 

A 

861 

38 

608 

APSPDA\MGHFTEEDKATITSLWGK\ 

VNVE\DAGGETLGRLLWYPWTQR 

FFD\SFGNLSSASAI\MGNP\KVKAH 

GKKVLTSLGDA1KHLDDLKG\TFAQ 

L\SELH\CDK\LHVDPENFKLLGEML 

LVTVNLAIPFRAKEFTPEGCRASWQ 

KQKMAEDGDLQWPSGPVPPDTTEA 

a WPMN bEAFKDKAFIL ASN YK 

836 


A 

VUJ 

797 



837 

6334 

A 

864 

432 

742 


838 

6335 

A 

865 

184 

352 


839 

6336 

A 

866 

204 

394 


840 

6337 

A 

867 

1 

2286 

MDLLGRVGSDWALQSSCLTDPELW 

GWEGTPRFLAAAAQGFGGPVLKAQ 

ACSLGAGIAPTELPRPVRWSLLFLA 

VRSNYQALWPQSPAGLPLVPQPETP 

RGANIPSVPVVHAGDDRGWHMTV 

EQKFGLFSAEIKEADPLAASEASQP 

KPCPPEVTPHYTWIDVRACSPTKAV 

U^ol WuAKI VrUVCjVAEPKAFGKL 

GQSAQNPSSAVSAGPRFLVQRFEIA 

KYCSSDQVEIFSSLLQRSMSLNIGRA 

KGSMNRHVAAIGPRFKLLTLGLSLL 


WO 01/79449 PCT/US01/08656 


SEQ ID 
t\U: or 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 

tllO 
d 

SEQ ID NO: 
in US6IN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
locstion of Inst 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







HADVVPNATIRNVLREKIYSTAFDY 

FSCPPKFPTQGEKRLREDISIMIKFW 

TAMFSDKKYLTASQLVPPADIGDLL 

EQLVEENTGSLSGPAKDFYQREFDF 

FNKITNVSAIIKPYPKGDERKKACLS 

ALSEVTVQPGCSLPSNPEAIVLDVD 

YKSGTPMQSAAKAPYLAKFKVKRC 

GVSELEKEGLRCRSDSEDECSTQEA 

DGQKISWQAAIFKLGDDCRQDMLA 

LQIIDLFKNIFQLVGLDLFVFPYRVV 

ATAPGCGVIECIPDCTSRDQLGRQT 

DFGMYDYFTRQYGDESTLAFQQAR 

YNFIRSMAAYSLLLFLLQIKDRHNG 

NIMLDKKGHIIHIDFGFMFESSPGGN 

LGWEPDIKLTDEMVMIMGGKMEA 

TPFKWFMEMCVRGYLAVRPCLGST 

GDRVQQIESCLGDVQDVAGEAWM 

DVWSLVTIMLDTGLPCFRG/QIKFL 

KHKrSrNMTEREAANFIMKVIQSCF 

JLaNKoKl YInJVUvJY Yv^NUlrY 

841 

6338 

A 

868 

3 

164 


842 

6339 

A 

869 

1 

5340 


843 

6340 

A 

870 

649 

1028 


844 

6341 

B 

871 

■I 

1 

5823 

MCPVDFHGIFQLDERRRDAVIALGI 

FLIESDLQHKDCWPYLLRLLKGLP 

KVYWVEESTARKGRGALPVAESFS 

FCLVTLLSDVAYRDPSLRDEILEVLL 

QVLHVLLGMCQALEIQDKEYLCKY 

AIPCLIGISRAFGRYSNMEESLLSKL 

FPKIPPHSLRVLEELEGVRRRSFNDF 

RSILPSNLLTVCQEGTLKRKTSSVSS 

ISQVSPERGMPPPSSPGGSAFHYFEA 

SCLPDGTALEPEYYFSTISSSFSVSPL 

FNGVTYKEFNIPLEMLRELLNLVKK 

IVEEAVLKSLDAIVASVMEANPSAD 

LYYTSFSDPLYLTMFKMLRDTLYY 

MKDLPTSFVKEIHDFVLEQFNTSQG 

ELQKILHDADR1HNELSPLKLRCQA 

SAACVDLMVWAVKDEQGAENLCI 

KLSEKLQSKTSSKVIIAHLPLLICCL 

QGLGRLCERFPVVVHSVTPSLRDFL 

VIPSPVLVKLYKYHSQYHTVAGNDI 

KISVTNEHSESTLNVMSGKKSQPSM 

YEQLRDIAIDNICRCLKAGLTVDPVI 

VEAFLASLSNRLYISQESDKDAHLIP 

DHT1RALGH1AVALRDTPKVMEPIL 

QILQQKFCQPPSPLDVLIIDQLGCLVI 

TGNQYIYQEVWNLFQQISVKASSV 

VYSATKDYKDHGYRHCSLAVINAL 

ANIAAN1QDEHLVDELLMNLLELFV 

QLGLEGKRASERASEKGPALKASSS 

AGNLGVLIPVIAVLTRRLPPIKEAKP 

RLQKLFRDFWLYSVLMGFAVEGSG 

LWPEEWYEGVCEIATKSPLLTFPSK 

EPLRSVLQYNSAMK^DTVTPAELSE 

LRSTIINLLDPPPEVSALINKLDFAM 

STYLLSVYRLEYMRVLRSTDPDRFQ 

VMFCYFEDKAIQKDKSGMMQCVIA 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /-possible nucleotide deletion; \=possible 
nucleotide insertion) 







VADKVFDAFLNMMADKAKTKENE 

EELERHAQFLLVNFNHIHKRIRRVA 

DKYLSGLVDKFPHLLWSGTVLKTM 

LDILQTLSLSLSADIHKDQPYYDIPD 

APYRITVPDTYEARESIVKDFAARC 

GMILQEAMKWAPTVTKSHLQEYLN 

KHQNWVSGLSQHTGLAMATESILH 

FAGYNKQNTTLGATQLSERPACVK 

KDYSNFMASLNLKNRYAGEVYGMI 

RFSGTTGQMSDLNKMMVQDLHSA 

LDRSHPQHYTQAMFKLTAMLISSK 

DCDPQLLHHLCWGPLRMFNEHGM 

ETALACWEWLLAGKDGVEVPFMR 

EMAGAWHMTVEQKFGLFSAEIKEA 

DPLAASEASQPKPCPPEVTPHYIWID 

FLVQRFE1AKYCSSDQVEIFSSLLQR 

SMSLNIGGAKGSMNRHVAAIGPRF 

KLLTLGLSLLHADVVPNATIRNVLR 

EKIYSTAFDYFSCPPKFPTQGEKRLR 

EDISIMIKFWTAMFSDKKYLTASQL 

VPPDNQDTRSNLDITVGSRQQATQG 

WINTYPLSSGMSTISKKSGMSKKTN 

RGSQLHKYYMKRRTLLLSLLATEIE 

RLITWYNPLSAPELELDQAGENSVA 

NWRSKYISLSEKQWKDNVNLAWSI 

SPYLAVQLPARFKNTEAIGNEVTRL . 

. VRLDPG A VSDVPEAIKFLVTWHTID 

ADAPELSHVLCWAPTDPPTGLSYFS 

SMYPPHPLTAQYGVKVLRSFPPDAI 

LFYIPQIVQALRYDKMGYVREYILW 

AASKSQLLAHQFIWNMKTNIYLDE 

EGHQKDPDIGDLLDQLVEEITGSLS 

GPAKDFYQREFDFFNK1TNVSAIIKP 

YPKGDERKKACLSALSEVKVQPGC 

YLPSNPEAIVLD1DYKSGTPMQSAA 

KAPYLAKFKVKRCGVSELEKEGLR 

CRSDSEDECSTQEADGQKISWQAAI 

FKVGDDCRQDMLALQIIDLFKNIFQ 

LVGLDLFVFPYRWATAPGCGAIEC 

1PDCTSRDQLGRQTDFGMYDYFTR 

QYGDESTLAFQQARYNFIRSMAAY 

SLLLFLLQSKDRHNGNIMLDKKGHI 

IHIDFGFMFESSPGGNLGWEPRHQA 

DG* 

845 

6342 

A 

872 

1 

337 


O A £ 

846 

6343 

A 

873 

1 

337 


847 

6344 

A 

874 

838 

929 51 


848 

6345 

A 

875 

21 

338 


849 

6346 

A 

876 

2 

424 


850 

6347 

A 

877 

3 

452 


851 

6348 

A 

878 

3 

604 

PTLLVPTDSERTHPWLLSPADKYTN 

VKAXAWGKVGAHAGEYGAEALER 

MFLSFPTTKTYFPHFDLSHG\SAQV\ 

KGHG\KKVADALTNAVAHV\DDMP 

N\ALSALSDLHAHKL\RVGPGSTFKL 

LK/HTCLAGEPWAAHLP\AEFQPLA 

VATSSLGTKFPGLLVEAPLLTFQIPV 

KAGSLGWPLFFCPLGLPPSPSSPFLH 


WO 01/79449 


SEQIO 
NO: of 
nucleo-tide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


Me 
tbo 
d 


SEQIDNO: 
inUSSN 
09/770,160 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 


PCT/US01/08656 


Amino acid sequence ( X=Unknown; *=Stop 
codon; impossible nucleotide deletion; V=possible 
nucleotide insertion) 


PYPRGL 


852 


6349 


879 


416 


EGKPRTSGAEHRSCRGKASMSPNF 

KLQCHFILIFLTALRGESRYLELREA 

ADYDPFLLFSANLKRELAGEQPYRR 

ALRCLDMLSLQGQFTFTDDRPQLH 

CAGFFISEP\EESLPFHY\DQ*SIDGK 

AGNFLKVLMGRIL 


853 


6350 


880 


187 


854 


6351 


881 


1099 


PRVRGRVGEGVGRKAQDLRSRQHS 

SCRGKASMSPNFKLQCHFILIFLTAL 

RGESRYLELREAADYDPFLLFSANL 

KRDVAGEQPYRRALRCLDMLSLQG 

QFTFTADRPQLHCAAFFISEPEEFITI 

HYDQVSIGLSKGGDF/LWKVFDGWI 

LKGEKFP\SSQ\DHPLPSAERYIDF\C 

ESGLSRRSIRSSQNVNAMIFFRVHEP 

GNGISHLTIKTDPNLFSFAMFISSEFQ 

MGKFNLG*FPHQHRNCSFSIIYPVWI 

KISDLYPGGHVNGSFS*RKSS\AGCE 

GIGDFVELLGGTGLDPSKMTPLADL 

CYPFHGPAQMKVGCDNTVV\RMVS 

SGKHVNRVDFLRIVQLEAVTSWEN 

PNG\NS1GEFC\LSGL 


855 


6352 


882 


645 


HGIQAHGQIPSYKTIGGRDDSFHTFF 

SETGAGKHVPRLLL*NWKPTVMDE 

VRTGTYCQLFHLEQFITARKIAANN 

YARGHYTIGKE1IDLVLDRIRKLAD 

QCTGLQGFLVFHSFGGGTG SGFTSL 

LMERLSVDYGKKSKLEFSIYPAPQV 

STAWEPYNSILTTHTTLEHSDCAF 

MEEGEFSEAREDMAALEKDYEEVG 

VDSVEGEGEEEGEEY 


856 


6353 


883 


90 


1657 


EATTSPLRLRHQLGSREAATMRECI 

SIHVGQAGVQIGNACWELYCLEHGI 

QPDGQMPK*PKPLGEGDDSFNTFFS 

ETGAGKHVPRAVFVDLEPTVIDEVR 

TG\TYRQLFHPEQLITGKEDAANNY 

ARGN\YTIGKEIIDLVLDRIRKLA\DQ 

CTG\LQGFLVFHS\FGGG\TGSGFTS\ 

LLM\ERLSVDYWQESPSLEFSIYPAA 

PRFPQPVVEP\YN\SILPTQHPPWEHS 

DCAVFMWDNEAIYDICRRNLDIERP 

TYTNLNRLIXSQIVSSITASLRFDGAL 

NVDLT\EFQTNLG\PYPPIHFPL\ATY 

APCHLC*RKPTHEQLFCSQRSPKCF 

AFEPTNPDG*NGDPR\HG*1HWLAC 

LLLP/RGDVVPKRCQMLPIAHPSKP 

KRS\IQFVDWCP\TGFKV\GINYQPP\ 

TVVPGGDLA\KVTREAVCMLSKHH 

SPFAEAWARPGPTSFDLMLCQACPF 

VHWYLG\EGMEEGEFSK\ARKDMA 

AL\RKDYEEVG\VDSVKG\EGEEEGK 

GILIIHSLFGPCSMSCSQNFSFSLTDR 

R 


857 


6354 


B 


884 


46 


386 


X1RHESGSRSHSHCSTLSSIGDVAKK 
LGEMWNNTAADDKQPYEKKAAKL 
KEKYEKDIAAYRAKGKPDAAKKG 
WKAEKSKKKKEEEEDEEDEEDEE 


WO 01/79449 PCT/US01/086S6 


SEQ ID 

NO* nf 

nucleo-fide 
sequence 

SEQ ID 

NO- nf 

peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in I ISSN 
09/770,160 

Nucleotide 

1 nro f tnn nf 
lUCdllUIl Ul 

first codon 
for peptide 
sequence 

Nucleotide 

Inrntinn nf I Act 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
cuuud, /— possioic nucicoiiue oeieiion, v^possiuie 
nucleotide insertion) 







EEEDXEDDDEEEDDDDE* 

858 

6355 

A 

885 

263 

484 


859 

6356 

A 

886 

146 

826 

TWGKGDPKKPRGKMSSYAFFVQTC 

RGG\HKKKHPDASVNFS/ESFSKKCS 

ERWKTMSA*R/EKGKFEDMAKA\D 

KARYNEREMKTYIPPQRGRQKRKFK 

DSQLHPRGPPS\AFFPLLALEYRPKI 

K\GEHP\GL\SIGDVAKKLGRDVGIN 

I AADUJKQPYEKKvAAKLKEKYEKD 

IAAYRAKGKPDAAKKGWVKAEKS 

KKKKEEEEDEEEG\DEEDEEEEEDE 

EDEEDEEEDER 

860 

6357 

A 

887 

1 

456 

RPRRPQREPTMVLSPADKTNVKAA 

WGKVGAHAGEYGAEALE/RMFL/SF 

PTTKTYFPHFDLSHGSSQVKGHGKK 

\F A T*"\ A T fV T A T 7/™»T TX fr\Pl* M T1XY AT OA* O 

VADALTNAVGHVDDMPNALSALS 
DLHAHKLRVDPVNFKLLSHCLLVT 
LAAHLPAEFTPAVHAFLDKFLASVS 
TVLTSKYR 

861 

6358 

A 

888 

2 

435 

QTQREPTMVLSPADKTNVKAAWG 

KVGAH/AGEYGAEALERMFLSFPTT 

KTYFPHFDLSHGSAQVKGHGKKVA 

r\ A T r "l *V T A 1 / /T"»T It f r\T\l M T"1"V TAT OAT OT*^ T 

DALTNAV/EHVDDMPNALSALSDL 
HAHKLRVDPVNFQAPKATGLLVDP 
GPAHFPGRVSPLRLQGFLGTKFLGF 
C 

862 

6359 

A 

889 . 

9 

390 

NSARATDSERTHHGARLLPDKTNV 
KA\AWGKVGAHAGEYGAEALERM 
FLSFPTTK\TYFPHFDL\SHG\SAQ\VK 
OF1AKK.VAERADQTPWRNVDDMP 

VDDrDDtCni ij\AMt/i VO\/T\D17nT CO 

KJ<KCrr*bULH\AHKL\KVL)FV(jLbS 
S*SHLPCW 

OOJ 

OjOV 

A 
A 


Z 

All 


864 

6361 

A 

891 

2 

6281 


865 

6362 

B 

892 

79 

200 

XGDYPLGDLTPTTMEEATSGVNESE 
MAVASGHLNSTGVLLE* 

800 

6363 

B 

OA'S 

893 

209 

502 

MLLMYNSSDHDVYHMAVEMQRD 
VLEQIQQFLATQLIMQTSESGISAKS 
LRGRDSTRKQDASEKDSVPMGSPA 
FFSLSLWDTSGFGWILNKIIPMTLS* 

. 867 

6364 

A 

894 

283 

340 


868 

6365 

B 

895 

1649 

1741 

MSFAMTLKKKLEEEEAEVKRKATD 

AAYQARQAVKTPPRRLPTVMVRSP1 

DSASPGGDYPLGDLTPTTMEEATSG 

VTPGTLPSTPVTSFPGIPDTLPPGSAP 

LEAPMTPVTDDSPQKKMLGQKATP 

PPSPLLSELLKKGSLLPTSPRLVNES ' 

EMAVASGHLNSTGVLLEVGGVLPM 

IHGGEIQQTPNTVAASPAASESVSQ 

ATIVMMPALPAPSSAPAVSTTESVA 

PVSQPDNCVPMEAVGDPHTVTVSM 

DSSEISMIINSIKEECFRSGVAEAPVG 

SKAPSIDGKEELDLAEKMDIAVSYT 

A V fcAALor CbblN UUr VefoLr Or Wbrii* 

IQQERDKPVPLPAPEMTVKQERLDF 

EETENKGIHELVDIREPSAEIKVEPA 

EPEPVISGAEIVAGVVPATSMEPPEL 


WO 01/79449 PCT/US01/08656 


SEQ ID 
wci: of 
nucleo-tide 
sequence 

SEQ ID 

NO: 01 
peptide 
sequence 

Me 
cno 
d 

SEgTb NO: 

in IfCCM 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 

In rati An of fact 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 

f*nrfmi" ^3nnccihlf* nn**l«»AtifiA #ff*ff*tiAn* VsnnccihlA 
^vUUIlf / pUSalUIC IIUCICUIIUC UCICUUIIf \~UOSolUIC 

nucleotide insertion) 







RSQDLDEELGSTAAGEILEADVAIG 
KGDETPLTNVKTEASPESMLSPSHG 
SNPIEDPLEAETQHKFEMSDSLKEES 
GT1FGSQIKDAPGEDEEEDGVSEAA 

ESDDGFSIHNATLQSHTLADSIPSSP 
ASSQFSVCSEDQEAIQAQKIWKKAI 
MLVWRAAANHRYANVFLQPGTR* 

869 

6366 

A 

896 

3 

2926 

PGSTISSGTGKHKLLSTGPTEPWSIR 

EKLCLASSVMRSGDQNWVSVSRAI 

KPFAEPGRPPDWFSQKHCASQYSEL 

LETTETPK*VQSQ\RKRGEKGEWE 

TVEDVIVRKLTAERVEELKKVIKET 

QERYR\RLKRDAELIQAGHMDSR\L 

DELCNXDIATKKKLEEEEAEVKRKA 

TDAAYQARQAVKTPPRRLPTVMVR 

SPIDSASPGGDYPLGDLTPTTMEEA 

TSGVTPGTLPSTPVTSFPGIPDTLPPG 

SAPLEAPMTPVTDDSPQKKMLGQK 

ATPPPSPLLSELLKKGSLLPTSPRLV 

NESEMAVASGHLNSTGVLLEVGGV 

LPMIHGGEIQQTPNTVAASPAASGA 

PTLSRLLEAGPTQFTTPLASFTNWA 

SNKPPVKLVPPPVEFFSQATIVMMPA 

LPAPSSAPAVSTTESVAPESQPDNC 

VPMEAVGDPHTVTVSMDSSEISMII 

NSIKEKCFRSGVTEAPVGSKAPSIDG 

KEELYLAEKMEIAVSYTGEELDFET 

VGDIIAI1EDKVDDHPEVLDVAAVE 

AALSFCEENDDPQSLPGPWEHPIQQ 

ERDKPVPLPAPENMTVKQERLDFEE 

TENKGIHELVDIREPSAEIKVEPAEP 

EPVISGAEIVAGVVPATSVMEPPELR 

SQDLDEELGSTAAGEIVEADVAIGK 

GDETPLTNVKTEASPESMLSPSHGS 

NPIEDPLEAETQHKFEMSDSLKEES 

GTIFGSQIKDAPGEDEEEDGVSEAA 

SL*EPKEEDQGEGYLSEMDNEPPVS 

ESDDGFSIHNATLQSHTLADSIPSSP 

ASSQFSVCSEDQEAIQAQKIWKKAI 

MLVWRAAANHRYANVFLQ\PVTD 

DIAPGYHSIVQRPMDLST1KKNIENG 

LIRSTAEFQRDIMLMFQNAVMYNSS 

DHDVYHM\AVEMQRDVLEQIQQFL\ 

RKQDASEKDSVP\MGSPAF\LLSLFD 
GGTQGTPLCPLKPDMKMKKVKPQS 
YPL 

870 

6367 

A 

897 

150 

425 

\/VTJUT \7A1 VTDDCT \4\/PPrT , POP'E r DC/ 

V YHrbVALlSirr'oJLrlVl VrrLLrbrfro/ 
PPRT PPHPVT PPT PPQPQPQ>JP*VT HQ 

PRGLSPPPLLG*GPP\PPKPACFCSFP 
RDPGKLRWALRG 

871 

6368 

A 

898 

65 

259 


872 

6369 

A 

899 

273 

962 

KRERAVSLGQSGLPAVRAGPQGGG 
PTwrsArjAi ocvmACCiAC^ r^tp 

\_* 1 \rvJf\L/f\Li\J\J 1 VJ/\^\J/\wOi^JX.OO 1 i 

HFLGQSERVPH*QGG\TGIFHHPEHG 
S* AKK/DP VRPPCA *QGKG VAFLPA 
EA*VSGDQGGPGAVLSP/GRDCPFPS 


WO 01/79449 PCT/USO 1/08656 


SEQ ID 5 
NO: of 1 
nucleo-tide | 
sequence ! 

;eq id i 

NO: 01 i 
peptide < 
iequence 

Vie 5 

bo i 
1 ( 

SEQ ID NO: I 

)9/770,160 1 

1 

Nucleotide 1 

nftnfinn fit 

Irstcodon i 
ror peptide 
iequence 

Nucleotide i 
ocation of last < 
:odon for last 
amino acid of 
peptide 
sequence 

\mino acid sequence ( X=Unknown; *-Stop 
:odon; A=possibIe nucleotide deletion; \=possible 
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PGP/PGNPQPLAARQGPAPGNSGSL 

WPWOEPPVDWPSEGTP\GPLLROOL 

QSQPKNATGRERHPPQT/AKPFPSCP 

NTVL*1PEIK*NPWGEQQSRPALGST 

QDQRICNNH 

873 

6370 

A 

AAA 

i 
1 


K R K. VSLCHPG WSAG APSRLTATSSS 
LV\KRFSCLSFPSSWDYRCAPHLAN 
L/CRG/RGFTMLARLVLNS*PQMIYQ 
SRPPKVLGLQV 

874 

6371 

A 

901 

327 

638 

LGLQGSTIFHKTLKKDLLQLEKQLN 

FFGLLRN*PTNTLDRFVFGFENTHLS 

VL/QRKTISFNLVCWSHTPSINVCAI 

YQ 

875 

6372 

A 

902 

834 

1187 

RKYETCLS ALE1FT* SCSAVGI1*FFC 

T PI r,nPVI 1 PPPHT PTHPH r-TP WMY 
L,rLt\JLJiZ V LtLi\*\**r VJL»r 1 wV^rirllx WIN I 

SLKLLGSKRSFCLSLLSSWNYRHAP 
PSLGF*KNFKKNFE\KDLAML\PGLV 
FNS\YP*V1LLWASSNG 

876 

6373 

C 

903 

1 CA 


1V/TCIT PT nQVT\IM>J A n*KTI YnOMITMM 
lVlolJOr JUV^O I UNlVllNrVVJlNJL* I XJV^iVIJTirNAN 

FPYIVKQKKQVCRTVCTVSLVYHK 
MCVYMCVCECLXXXXXXXXXXX 
X* 

877 

6374 

A 

904 

29 

372 

SYENNHSYAGWSGSRKRFTLFLQ1Y 

/fDVlTPI VII 1 YVPPn*VWPFKVT* 

I*MQEIYMDRCITIF/LYIVKQKKQV 

CRNSVYSITCLPQNVCGICVYVSVYI 

HTYIYIYTHTHH 

878 

6375 

A 

905 

1 

815 

MGNLGQVRRLSLWDYLLGLTHPRG 

LTTSQPGRSGLSPPAPPQQSFCMCQ 

NVTPGIMALGMSAVYFQVSGTKEQ 

PVPGHPMQSILLELWGFQVHHCVP 

GNPRPDFMEHSKDLTLSLLDHSCH 

WHGRSHSSKEYLELHRENFLL1LRS 

AFPTGLLRAWPRDGISQYLLVELKN 

NMFRFLVAGSAEGAAGPPCPGPRK 

V AlviSkftj riL>Pwy/\r rviN/\\jr xsjv w jl-'cvj 

R*GFPSQKQKEEQKKLGGA*KRKA 

RGGRGPWPTGG1KVKSGQKSKLFPW 

CLRRW 

879 

6376 

A 

910 

140 

512 

PARGEGSRLDPSQWGEPASCAKEPT 

GARCIIGGSAPSTPPSSARRRWPGG 
HS*AGRPGRSSRQEPGCCIDRAPGP 
GLPPPASQPPGAAPLRCPTAVGPS 

880 

6377 

A 

911 

68 

675 

RSTRTVHIPLLSCAQLPGQTP*PLSP 

WWFFCTPSSQGPPEPREDQPGCAPG 

PQEAPKPAGNLPPTDSSARAASETG 

RVLPS/PPTLIFCNLPRRG/FVSVAHL 

WLMSPFIRL*EATPGPGGQSGDLGG 

GP/DTSPTPCSRAAAAGMPTA*TLTP 

♦RILPRTAPSPTTPGEQLPRPGNSGR 

DG 

881 

6378 

A 

912 

3 

3492 

GGTVPQGLRTHGTGRGDTVGDDGE 

PPPQDRTLHLPQPPHPLPAPGQGAV 

PAGRGGGAAQP/AGSPTAPCGPGTxS 


WO 01/79449 PCT/US01/086S6 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Un known; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







GFAEDSREERGHRLPGEPEVPQP*R 
LHPG/PPGC/MPDVDFSNFSGESSDF 
DGLAGTSRN/RQAPGNPRSHGDIQA 
DRVPGWGHRQPAGGAEPGKGAEG 
GAAAAVPAAAGAPGPGRDPCRGPA 
PAGG*PQPHEA*G*RTLP*GAEAEG 
RDAQPLAAL*QCAAGEGAGRLTLP 
QPAGGAVSTEAGAAASQHGFLL*A 
GIARAVPEDSQRPGVRG*GAEPPEG 
GE*ETALADFQPGGEGHSGAEPGRG 
AGEPTGAGGAHPLAAGAGRGCREA 
ARAGQTLRAAELHGPCVPLCPVLG 
REGTDPAAVPEE*DGLPTLQGEGEC 
AAGPGVRAAEGARPGVLREGQCSE 
GDFPEPGGEGLPPQAGVRADGPGL 
RAAHTASPAAGRASGCAQAG/RPGP 
GSPVHGRSSGWCGCMPSAPETTAT 
A\PRQLHRVSALVGPECHVQPRAGG 
QLPLQQPRAPQPAVPVQAGGRGLR 
GRTLVFQQLPGDPGGRPGSPAGS*G 
RRPTPGL*APRHGRPSAAGKQPAAS 
LPWKA*CLGECTSRSSPGLQRRPHA 
. AEASPQDPEPGHHAGVPGGCIAGA 
DQRHRREPHGHLHPPGHPGLGGGP 
DGLAPGHPDCDG *LRSLRALVQGSP 
GGHDPGGGRGASQEGGRLLLPVCE 
GQHGRL*EATPGPGGQSGDLGGLIL 
HPGQPGHGGQGQRGAAGALQRGP 
AR/PPTPCSRAAAAGMPTA*TLTP*R 
ILPRTAPSPTTPGLSSSS*PSSRT*LSS 
AP*PASHL/PGGPQ/IAGPHRQYGQS 
QGQPSAFVL*QGPVGPQQDGGLQH 
VLLGRELPHPGALYPGAAPSTRPAP 
ACAPRAQGGWEDPERETVPPPRV*E 
VPGRVLEPGGV*GLEP\GGDIIPGGR 
GVRGPLLGEPANSCGVPHGKEHPCP 
PGRPAGQCLHPAQDGHLPHRHPRL 
CQREDGKEAQEGPTAVGHLRGAAP 
GGCEAGGGRPGPGALSIQQPGS*RL 
ERPGRPAQLCPPGHRRRAEEGATPT 
SSKN*PQARGRASPPSNASVTEELT 
QGRGWALPPSNASVTEELTQARGR 
ASPPSNASVTEELTQARGRASPCLH 
LRRLSKKDKLLPRNTTGSKLITSGSL 
LPISWKPAWGTGT 

882 

6379 

A 

913 

232 

485 

TRLRLTPKVCPYRWSHFDRKFLSRV 
LMRRSAQKSRDRILNVFHELNL/NS 
VLDMRPMEF*GLRAAS*PQGERRGS 
LAFIREFHHT 

883 

6380 

A 

914 

2 

1163 


884 

6381 

A 

915 

771 

1597 

GACHLRLTPKVCPYRWSHFDRKFP 

SARVL\MRRSAQKSS/RDRILNVFHE 

LVNLKDAISYVAEVAEPLALPGRGC 

SRLGHWLIQFWT*GQWSFRVSGLLP 

D/TQGERRGSLAFIRSPSTDNVVNV 

DFTPRSSTVEASVSYLLYVAMVMQ 

LPWGRAQPRELRVTDRAVVAPGLG 

VAWKRGEVQKEGVGVSSHKPSYIR 
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PWPDSLSAGRKVKGRGSSGLGARP 
DVFAPGPQQPVMVPMPPLLLLRPW 
APQLTASSHRRSTLPDVQMLGSPSL 
TARALERDQ 

885 

6382 

A 

916 

3 

471 

DSWLWWLRQRLQQIGGISGSTSTSS 

MLSRAVCGTSRQLAPVLGYLGSRQ 

KHSLPDLPYDYGALEPHINAQIMQL 

HHSKHHAAWNNLNVHEEKYQEA 

LAKGELLEAIKRDFGSFDKVKEK\L 

TAASVGGKGSCWGGLGFNKERGH 

LQIAAWPNQDP 

886 

6383 

A 

917 

54 

873 

GPRAAQERHSWLWWLRQRLQQIG 

GISGSTSTSSMLSRAVCGTSRQLAP 

VLGYLGSRQKHSL\PDLPYDYGAL\ 

EPH1NAADHASLHHSKHHA/APYVN 

NLNVXTEEKYQGGLWPRGDVYSPR* 

ALQPCT*KF\NGGG\HIN\HSIF\WT\N 

PQAPNGGGETQRGSLLGSHQNVDF 

GS\FDKFK\EKLTAASVGCPKAPGW 

GW\LGFQ*GNRGH/LYQ1AACPKSGI 

PLQGTTG/LLFPLLGIDVWEHALLPS 

SIKNVRPDYLKAIWNVI\NWENVTE 

RYMACKK ; 

887 

6384 

A 

918 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGE 

TLGRLLVVYPWTQRFFDSFGNLSSA 

SAIMGNPKVKAHGKKVLTSLGDAI 

KHLDDLKGTFAQLSELHCDKLHVD 

PENFKLLGNVLVTVLA1HFGKEFTP 

EVQASWQKMVTGVASALSSRYH 

m 

6385 

A 

919 

41 

601 

APSPRRPWGHFTEEDQGLLSTSLWG 

KWKCGKNAGRKKPLGKAPLVVL/H 

PWDPKRSFEQALGNPVPLPSAMMG 

NPPKSRAHGK\KVLT\SLGEMPIKHP 

G*SSKGTFAQA*SELH\CDK\LHVDP 

ENFK\LLG\NVLVT\VL\AIPFSAKEFT 

PGGCRASWAERWVTWSWPVPCSS 

RIPLSSLAHDCRAFQG 

889 

6386 

A 

920 

* 

14682 

14931 

E1GPGPRPLPSPLP*ATSTSVLAASGR 

PERTRXHAGIKIVLEDIFTLWRQVET 

KVRAKIRKMKVTTKVNRHDKINGK 

RKTAKEQSPLLQESLFATGDVSHNL 

LRALDVGLLANLSALAELDISNNKI 

STLEEGIFANLFNLSEINLSGNPFEC 

DCGLAWLPRWAEEQQVRVVQPEA 

ATCAGPGSLAGQPLLGIPLLDSGCG 

EEYVACLPDNSSGTVAAVSFSAAHE 

GLLQPEACSAFCFSTGQGLAALSEQ 

GWCLCGSAQPSSASFACLSLCSGPP 

PPPAPTCRGPTLLQHVFPASPGATLL 

AAFHIAAPLPVTATRWDFGDGSPEV 

DAAGPAASHRYVLPGRYHVMAVL 

ALGAGSALLGTDVQVEAAPAALEL 

VCPSSVQSDESLDLSIQNRVGSGLE 

AAYSIVALGEEPARAVHPLCPSDTEI 

FSGNGHCYRLVVEKAAWLQAQEQ 

CRAWAGATLAMVDSPAVQRFLVS 

RVTRSLDMWIGFSTVQGVEVGPAP 

QGEAFSLESCQNWLPGEPHPATAEH 
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> 




CVRLGPTGWCNTDLCSALHSYVCE 

LRPGGPVQDAENLLVGAPSGDLQG 

PLMPLARQYGLSAPHEPVEVMVFP 

GLRLSREAFLTTAEFGTQELRRPAQ 

LRLQVYRLLSTAGTPENGSEPESRSP 

DNRTQLAPACMPGGRWCPGANICL 

PLDASCHPRPAPMAARQGPGLLGA 

PYALWREFLFSVPAGPPAQYSVTLH 

GQDVLMLPGDLVGLQHDAGPGALP 

HCSPAPGHPGPQAPYLSANASSWLP 

HLPAQLEGTWACPACALRLLAATE 

QLTVLLGLRPNPGLRLPGRYEVRAE 

VGNGVSRHNLSCSFDVVSPVAGLR 

VIYPAPRDGRLYVPTNGSASVLQVD 

SGASATATARWPGGSVSARFENAC 

PALVATFVPGCPWETNDTLFSVVAL 

PWLGEGEHVMDVWENSASRANLS 

LRVTAEEPICGLRATPSPEARVLQG 

VPVRYSPVVEAGSDMVFRWTINDK 

QSLTFQNVVFNVIYQSAAVFKLSLT 

ASNHVSNVTVNYNITVERMNRMQ 

GLRVSTVPAVLSPNATLALTAGVLV 

DSAVEVAFLWTFGDGEQALHQFQP 

PYNESFPVPDPSVAQVLVEHNVTHT 

YAAPGEYVLTVLASNAFENRTQQV 

PVSVRASLPSEAVGVSDGVLVAGRP 

VTFYPHLLPSPGGVLYTWDFGDGSP 

VLTQSQPAANHTYPSRGIYHVRLEV 

NNTVSGAAAQADVRVFEELRGLSV 

DMSLAVEQGAPVWSAAVQTGDNI 

TWTFDMGDGTVLSGPEATVEHVYL 

RAQNCTVTVGAASPAGHLARSLHV 

LVFVLEVLRVEPAACIPTQPDARLT 

AYVTGNPARYLFDWTFGDGSSNTT 

MRGCPTVTrflvJFTRSGTFPLALVLSS 

RVNRARYFTSICVEPEVGNVTLQPE 

RQFVQLGDEARLVACAWPPFPYRY 

TWDFGTEEAVPARVGGPEVTFIYRD 

PGSYLVTVTASNNISAANDSALVEV 

QEPMLVTSIKVNGSLGLELHYLWD 

LGDGGRLEGPEVTHAYNSTGDFTV 

RVAGCNEVSRSEAWLNVTVKRRVR 

GLIVNASCTWPLNGSMSFSTSLEA 

GSDVRYSWVLCDRCTPISGAENEV 

GSAQDSIFVYVLQLIEGLQVVGGGR 

YFPTNHTVQLQAWRDGTNIYSWT 

AWRDRGPALAGSGKGFSLTALEAG 

TYHVQLRATNMLGSAWADCTVDF 

VEPVGWLMVAASPNPAAVNTSVTL 

SAELAGGSGVVYTWSLEEGLSWET 

PEPFTTHSFPTPGLHLVTMTAGNPL 

GSANATVEVDVQVPVSGLSIRASEP 

GGSFVAAGSSVPFWGQLATGTNVS 

WCWAVPGGSSKRGPHVTMVFPDA 

GTFNIRLNASNAVSWVSATYNLTV 

EEPIVGLVLWASSKWAPGQLVHF 

QILLAAGSAVTFRRQVGGASPEVLP 

GPRFSHSFPRIGDHVVSVQSKNHVS 


WO 01/79449 PCT/US01/08656 
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WAQAQVRIVVLEAVSGLQVPNCCE 

PGIAMGTERNFTARVQRGSRVAYA 

WYFSLQKVRGDSLFELSGRDVTYTP 

WPRGCWRSSENRTLVLEVQDAVQ 

YVALRSGPCFTNRLAQFEAATSPSP 

RRVAYHWDFGDGSPGQDTDKPRA 

EHSYLRPGDYRVQVNASNLVSFFV 

AQATVTVQVLACREPEVDVVLPLQ 

VLMRRSQRNCLDAYVDLRDCVTY 

QTEYRWEVYRTASCQRPGCPARVA 

LPGVDVSRPQLVLPRLALPVGHYCF 

VFVVSFGDTPLARSIQANVTVAPER 

LVPITEGGSYRVWSDTQDLVLDGSE 

SYDPNLEDGDQTPLSFQWACVAST 

QREAGGCALNFGPRGSSTVTIPRER 

LAAGVEYTFSLTVWKAGRKEEATN 

QTCWWRPRALPSLFLMQILCNTTA 

CFSFASFQTCHSSTYSLQATYALVT 

KATQSPSNTNRSSWLQYTRTHTPVS 

SALCMPFRRPGWKVANRMSILGGG 

WHDAEDAGAPLVYALLLQRCCQG 

HCKEFCVYKSSLSGYGAVLPPGFRP 

HFEVGLAVWQDQLGAAWALNR 

SLAITLPEPNGSAMGLTVWLHRLTA 

SVLPGLLRQADPQHVIEYSLALVTV 

LNEGPSRELVCRSCLKQTLHKLEA 

MMRILQAETTAGTVTPTAIGDSILNI 

TGDLIHLASSDVRAPQRSELGAESP 

LRMVASQAYNLTSALMRILTRSRV 

LNEEPAFSRAPANLSDVVQLVFLVD 

SNPFLFGYISNYTVSTKVASMAFQT 

QAGAQIPIERLASERAITVKVPNNSD 

WAARGHRSSANSVWQPQASVGA 

VVTLDSSNPVAVLHLQLNYTLLDG 

RYLSEEPEPYLAVYLHSEPRPNERN 

CSASRRIRPESLQGADHRPYTFFISP 

GTRDPVGSYRLNLSSHFRWSALEVS 

VGLYTSLCQYFSEEDVVWRTEGLL 

PLEETSPRQAVCLTRHLTAFGASLF 

MPPSHVRFVFPEPTADVNYIVMLTC 

AVCLVTYMVMAAILHKLDQLDASR 

GCAIPFCGQRGRFKYEILVKTGWGR 

GSGTTAH VG IML YG VDSRSGHRHL 

DGDRAFHRNSLDIFQIATPHSLGSV 

WKIRVWHDNKGLSPAWFLQHIIVR 

DLQTARSTFFLVNDWLSVETEANG 

GLVEKEVLAASHAALLRFRRLLVA 

ELQRGFFDKHIWLSIWDRPPRSCFT 

RIQRATCCVLLICLFLGANAVWYG 

AVGDSAYSTGHVSRLSPLSVDTVA 

VGLVSSVVVYPVYLAILFLFRMSRS 

KVLDIDSCLDSSVLDSSFLTFSGLHA 

EVRALLGVLGWAGGPAALAQLGL 

QTLCTSQQAFAGQVKSDLFLDDSK 

RSCjrVPVPFPPPCPKPPPPLSWLPQG 

ALKGPGHAGIKIVLEDIFTLWRQVE 

TKVRAKIRKMKVTTKVNRHDKING 

KRKTAKEQ 
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890 

6387 

B 

921 

1 

714 

MVKLSIVLTPRFLSHDQGQLTKELQ 

QHVKSVTCPCEYLRKVINTLADHR 

HRGTDFGGSPWLLIITVFLRSYKFAI 

SLCTSYLCVSFLKTIFPSQNGHDGST 

DVQQRARRSNRRRQEGIKIVLEDIF 

TLWRQVETKVRAKICKMKVTTKV 

NRHDKINGKRKTAKEHLRKLSMKE 

REHGEKERQVSEAEENGKLDMKEI 

HTYISPLLQESLFATGSEWRQRSIVI 

LQDCPTGPTSQLKL* 

891 

6388 

B 

922 

1 

387 

MRVRWLLFWLLFWLLLGFISHQST 

CVINTLADHRHRGTDFGGSPWLLII 

TVFLRSYKFAISLCTSYLCVSFLKTIF 

PSQNGHDGSTDVQQRARRSNCRRQ 

EGIKIVLEDIFTLWRQVETKVRAKIR 

KMK* 

892 

6389 

A 

923 

277 

489 


893 

6390 

A 

924 

465 

634 


894 

i* 

6391 

A 

925 

1 

4652 

MGSTGVYKVTPRSCHRFEQAFYTY 

DTSSPSILTLTAIRHHVLGTITTDKM 

MDVTVTIKSSIDSEPALVLGPLKSV 

QELRREQQLAEIEARRQEREKNGNE 

EGEERMTKPPVQEMVDELQGPFSY 

DFSYWARVLCFVGTGPAKLKYINY 

FRSGEKITVTPSSKELLFYPPSMEAV 

VSGESCPGKLIEIHGKAGLFLEGQIH 

PELEGVEIVISEKGASSPLITVFTDDK 

GAYSVGPLHSDLEYTVTSQKEGYV 

LTAVEGTIGDFKAYALAGVTLHSQ 

DVLMLPGDLVGLQHDAGPGALLHC 

SPAPGHPGPQAPYLSANASSWLPHL 

PAQLEGTWACPACALRLLAATEQL 

TVLLGLRPNPGLRLPGRYEVRAEVG 

NGVSRHNLSCSFDVVSPVAGLRVIY 

PAPRDGRLYVPTNGSASVLQVDSG 

ASATATARWPGGSVSARFENACPA 

LVATFVPSCPWETNDTLFSWALP 

WLGEGEHVMDVWENSASRANLS 

LRVTAEEPICGLRATPSPEARVLQG 

VPVVLLAGSSGYLVGFKFLESHGSD 

SGSANSFHRLISRNEFKTPLPDLTRV 

PRYSPVVEAGSDMVFRWTINDKQS 

LTFQNVVFNVIYQSAAVFKLSLTAS 

NHVSNVTVNYNITVERMNRMQGL 

RVSTVPAVLSPNATLALTAGVLVDS 

AVEVAFLWTFGDGEQALHQFQPPY 

NESFPVPDPSVAQVLVEHNVTHTY 

AAPAALGGGAVLTRQPSVLLHLCS 

VPHVAWEPGTLKAGPQVSTVLTVL 

ASNAFENRTQQVPVSVCASLPSVSV 

CASLTGACWYPRVLIRSGRVPIVSL 

ECVSCKAQAVYEVSRSSYVYLEGR 

CLNCSSGSKRGGYTFTLTVLGRSGE 

EEGCASIPLSPNRPPLGGSCRLFPLG 

AVHALTTKVHFECMGWHDAEDAG 

APLVYALLLQRCRQGHCEEFCVYK 

GSLSGYGAVLPPGFRPQFEVGLAW 

VQDQLGAAWALNRSLAITLPEPNG 


WO 01/79449 PCT/USO 1/08656 
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SAMGLTVWLHGLTASVLPGLLRQA 

DPQLVIEYSLALVTVLNEYERALDV 

AAEPKHERQRRAQIRKNITETLVSL 

RVHTVDDIQQIAAALAQCMRKLPE 

QD1AQGSYIALPLTLLVLLAGYNHD 

KLIPLLLQLTSRLQGVGALGQAASD 

NSGPEDAKRQAKKQKTRRTLATSIN 

TSREPSTDDQLPAHNQTMPQRHAR 

RSAPPRAYDRKTRQEENPHQTRSH 

AAAKRRERPPHDLQKQATTRLIPAG 

PRRRDGTSPRRTQPPPNTRRPAAAG 

HLARFRRAAPGARGARPPTARRGR 

EELDPAHIYAAAPGLPTPPRAGRTPP 

TPERRDRNTRRRRTREEGEGEFRPV 

SFLKTIFPSQNGHDGSTDVQQRARR 

SNCRRQEGIKIVLEDIFTLWRQVET 

KVRAKIRKMKVTTKVNRHDKINGK 

RKTAKEHLRKLSMKEREHGEKERQ 

VSE AEENGKLDMNX* IHF YMEMFQR 

AQALRRRAEDYYRCKITPSARKPLG 

NRVSLLVFLAFGHSLPGQDMDTFFS 

LRLCASSPAEGDGREEGCLQAFTVP 

SLLVTVLRKNTFIPTQWGPHLIF 

895 

6392 

A 

926 

3 

156 

EMFQRAQ/ALRRRAEDYYRCKITPS 
A\RKLLCNRCTYNLVLPGSEKKYYS 
HA 

896 

6393 

A 

927 

183 

1518 

ASTQSAVGLVSSVVVYPVYLA1LFL 

FWMSRSKVAGSPSPTPAGQQVLDID 

SCLDSSVLDSSFLTFSGLHAEVINTL 

ADHQHRGTDFGGSPSVLIITVSLRSY 

KFAISLCTSYLWVINTLADHRHRGT 

DFGGSPWLLI1TVFLRSYKFA1SLCT 

TYLCWSFLKTIFPSQNGHDGSTDVQ 

QRARRSNCRRQEGIKIVLAEDiFTLW 

RQVETKVRAKIRKMKVTTKAATRLT 

KIKERRKTAQDHWRKLSMKEREHG 

EKERQVSEAEENGKLDMKEIHTYXM 

EMF\QRAQALRRRAEDYYRCKITLF 

QRKPLCNRVRMAAVEHRHSSGLPY 

WPYLPAETLKNRMGHQPPPPTQQH 

SIIDNSLSLKTPSECLLTPLPPSALPS 

ADDNLKTPAECLLYPLPPSADDNLK 

TPPECLLTPLPPSAPPSADDNLKTPP 

ECVCSLPFHPQRMIISRN 

897 

6394 

A 

928 

123 

1040 

WRWFTIGTFRILLMFCCLGYEWLSG 

/GCTTWHSAWV*GSSCHPAIICFLCF 

VAKSDP*RNPGKLRKERTPRSQGQG 

SWFGEDQKSGLSILWADIVHRGT\D\ 

FGGSPWLLIITVFLRSYKFAISLCTSY 

LCVSFLKTIFPSQNGHDGSTDVQQR 

ARXRSNRRRQEGLKSICMHTKKRVS 

SFAGIKIVLEDIFTLWRQVETKVRA 

KIRKMKVTTKVNRHDKINGKKKTA 

KEHLRK\LGMKERE\HEEKERQVSE 

AEENGKLDMKEIHTYMEMFQRAQ 

ALRRRAEDY*QHDKITPSARKAFFA 

NRVQQWRQW 

898 

6395 

A 

929 

39 

525 

TKFVLGTFQILFTASFSHPSWWPLA 
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inUSSN 

09/770,160 

Nucleotide 
location or 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







LENPHDSNLSGLFPLIDLDFSP*VLS 

CWASHTMENCS*LRSKRQ1TLWCS 

RMAEVLVYCLSWKCSHLKRHDFPM 

GKYQTPTCIDKGNMLYLSKLLGIES 

QCLGAEMGIPIKAMQSFTTSGRPKN 

EHSRNFVIIWKVLI 

899 

6396 

A 

930 

1030 

1384 

LIALRKMGRNAQAQIC1IT\SDG*NPS 
PLKTESTLKTVTQFSLYPWGEKFERT 
PSLMGQKNFRTVCQLSQMGAIGFQ/ 
HIQEWDGERKST\ITKKN*KDGEISW 
LECVMNNWTCTPDSMKK 

900 

6397 

A 

931 

1 

225 


901 

6398 

A 

932 

2 

167 


902 

6399 

A 

933 

1 

3339 

PASVHPSVRPTVQRKGLQAGRTSTR 

GTEARRGAKSAADPCGPGQGTVAA 

AMQSCARAWGLRLGRGVGGGRRL 

AGGSGPCWAPRSRDSSSGGGDSAA 

AGASRLLERLLPRHDDFARRHIGPG 

DKDQREMLQTLGLAS1DELIEKTVP 

ANIRLKRPLKMEDPVCENEILATLH 

AISSKNQIWRSYIGMGYYNCSVPQT 

ILRNLLENSGWITQYTPYQPEVSQG 

RLESLLNYQTMVCDITGLDMANAS 

LLDEGTAAAEALQLCYRHNKRRKF 

LVDPR\CHPQTIAVVQTRAKYTGVL 

TELKLPCEMDFSGKDVSGVLFQYP 

DTEGKVEDFTELVERAHQSGSLAC 

CATDLLALCILRPPGEFGVDIALGSS 

QRFGVPLGYGGPHAAFFAVRESLV 

RMMPGRMVGVTRDATGK\EVY\RL 

AP*KPREQHIRRDKATSNICTAQAL 

LANMAA\MFAI\YHGSHG\LGHIA\R 

RVHNATLILSEGLKRAGHQLQHDLF 

FDTLKIQCGCSVKEVLGRAAQRQIN 

FRLFEDGTLGISLDETVNEKDLDDL 

LWIFGCESSAELVAESMGEECRGIP 

GSVFKRTSPFLTHQVFNSYHSETNIV 

RYMKKLENKDISLVHSMIPLGSCTM 

KLNSSSEJLAPITWKEFANIHPFVPLD 

QAQGYQQLFRELEKDLCELTGHDQ 

VCFQPNSGAQGEYAGLATIRAYLN 

QKGEGHRTVCLIPKSAHGTNPASAH 

MAGMKIQPVEVDKYGNIDAVHLK 

AMVDKHKENLAAIMITYPSTNGVF 

EENISDVCDLIHQHGGQVYLDGAN 

MNAQVG1CRPGDFGSDVSHLNLHK 

TFCIPHGGGGPGMGPIGVKKHLAPF 

LPNHPVISLKRNEDACPVGTVSAAP 

WGSSSILPISWAYIKMMGGKGLKQ 

ATETAILNANYMAKRLETHYRILFR 

GARGYVGHEFILDTRPFKKSANIEA 

VDVAKRLQDYGFHAPTMSWPVAG 

TLMVEPTESEDKAELDRFCDAMISI 

RQE\IADIEEG\RIDP\RVNPLKNVLH 

TPLTCVTSSHWDRPYSREVAAFPLP 

FVKPENKFWPTIA\R1DD1YGDQHL\ 

VCTCPPM\EVYESPFS\EQKRAVFLV 

LCSLSFKGIDFDGLSPEAFDKQERFH 


WO 01/79449 PCT/U SO 1/08656 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQID NO: 
in USSN 
09/770 160 

Nucleotide 
location of 

fir it rnrinn 

for peptide 
sequence 

Nucleotide 
location of last 

cnrinn for lust 

amino acid of 

peptide 

sequence 

Amino acid sequence ( X=Unkno»vn; *<=Stop 
codon; /^possible nucleotide deletion; ^possible 

nurlpnttrlp in<pi*iinn\ 
iiuncuuuc iiidLi nun 1 







LPTPSLK 

903 

6400 

A 

934 

2 

287 


904 

6401 

A 

935 

36 

427 


905 

6402 

A 

936 

247 

1183 

CCWESPVDPQPLRQISGIALFCSFKE 

PPLLLGVLHPNTKLRQGRKGCLKIN 

LLGPES\MAHIGGCDVYWGQQMGR 

VVKLENG*NRRPLPGFGSGPLAKPE 

DDEPCVWGDPLGIRGRGPKWGLLF 

VGRLHTRGLF*R*NPWETVKVKLLL 

SSETPI*GGRNMSF\VNDLTVTPGW 

KEDLISPNPKQQNGQRRKLPAFWV 

MGGHKMTGRLLEV*YCDPGK*KFY 

WTQLRFPNGVQPVS*QKTLSLVAET 

S\MARIRRVYVSGPDERR\ADLFVEN 

MPGFPDNIRPSSSGGYWVGMSTIRP 

*P\GSSMLDFLSERPWD 

906 

6403 

A 

937 

179 

516 

• 

VFSVLRAEDKICELLFCLKIKLFSAVS 

FLVFRNQLPRKNDFYSYEPPSENPPP 

ETGESVCLQLKSGAHLCRVCGCLG 

PKTCSRCHKAYYCSKEHQTLDWRL 

GHKQACAQPGG 

907 

6404 

A 

938 

41 

274 

KRGTERKTHFGGCSIQFSDIASGKNI 
LPGLGFLTHKR\WFCSL*RQGWVSR 
WSHE*GCTRCWRLGKFLWVADRFL 
GSG 

908 

6405 

A 

939 

3 

mi 

CAPRQPAPRMAAAGARPVELGFAE 

SAPAWRLRSE\QFPSKVGGRPAWLG 

AAGLPGPQALACELCGRPLSFLLQV 

YAPL\PGR\PDA\FH\RCIFLFCCREQP 

\CCAGLARFLGIRLPRKN\DFYSYEP 

PSENPPPETGESVCLQLKSGAHLCR 

VCG\C*GPKTCSRCHKAILLAAREH 

QTL\DWEIGD1RQ\ACAQPDHL\DHI 

NFQDHNFPFFQEF\EIVIETEDEIMP* 

GVWKKEDYSRDY*GALG*STLKGR 

TWISM\AKHE\SRED\KFFQKF*NFRL 

ALGTEQDS*YAGRG/1APIWISGENIP 

QEKDIPDCPCGAKKJVLEFQVMPQLL 

NYLKADRLGKSIDWGILAAFTCAES 

CSLGTGYTEEFVWKQDVTDTP 

909 

6406 

A 

941 

f 

3458 

4042 

AGMIRRPSPWPSIRPPPAVFTNSCTS 

LQEPSGGTGRVQVPSIYQAS\STQIC 

VKGPD*GRNGKGNLSFGKAGIFHFP 

WCPKCPRPSSSPISMGLLSPEVDSVE 

R\PPTFRFPLAPIYKECV*NGAG/AQ 

APDPRQKRGWPCHWNLMGVGRMP 

RVSPHLPEAWGPKHPDDRYTKGTA 

ICPRNHLPCDPRISAIGQPQG 

910 

6407 

A 

942 

226 

401 

TSGDHWNIAVAPHENSDLLLVQGH 

DYKYRYFGLIVCVL*QAIVTPEEPQS 

IVPRLRTR 

911 

6408 

C 

943 

211 

282 

MFYPFFNPRYFSVGFIAMNRHTD* 

912 

6409 

A 

944 

1390 

1698 

HLFPHIKAGR*YGRPCREGILQ*KE* 

ETTGRHTCVLQGL\FQEWVQVRN 

VFLHEALQLVKFAMQIFEVLLEKFP 

EPIVKHDLDQNTECLFFRHMEKEHS 

SKK 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tbo 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; V=possible 
nucleotide insertion) 

913 

6410 

A 

945 

27 

412 

IAEGNWCVYMPDIIWVFPPQAEAEE 

DCHSDTVRADDDEENESPANETDLQ 

AQLQMFNRAQWMFELAPGVSSSNL 

ENRPCRAARGSL\QKTSADT\KGKQ 

EQAKEEKLSIIDIVTNYIFFFWHMEIF 

TGHSK 

914 

6411 

i « 

A 

946 

24 

1489 

GGSSAAASGVSSRADAPVLAQSPAS 

AGNGRPSTPRVPGSRRHPSAPRSGP 

LPREDGCRTPGPQLLPLHGA\LLRPR 

T\LLSSAAEDKARSRH\PDTQHPSSG 

GRCKGGTESPSSAAGRPASMAEAEN 

EDCHSDTVRADDDEENESPAETDV 

QALIP\MIQAQWKLEPAPRVSSSNLE 

NRPCRAARGSLQKTSADTKGKQEQ 

AKEEKARELFLQAVEEEQNGALYE 

AIKFYRRAMQLVPDI\EFKITF\TRSP 

DGDGVGNSYIEDNDDDSKMADLLS 

YFQQQLTFQESVLKLCQ\PELESSQI 

HISVLPMEVLMYIFRWVGSSDLDLT 

SUEQLSLVCQRIPNILCPETPENMPV 

LALL*KFWGR\SC\IKLVSVTSPGRE 

DVF*ERP\RV\RFDGVY1SKTTYIRQG 

EQSLDGFYRA\WH\QVEYYRYIRFFP 

DGHVMMLTTPEEPQSIVPRLR\TRE 

YQGLDAIPTGVTIRLSPRHRTIRTQSI 

WLLITKEKRKEKPL 

915 

6412 

A 

947 

17 

499 

DRVLLCN/PRLECNGM1TAHCSPKPP 

GSK*SSCFSLPSSWDY\KHEPPYRAN 

LKNFFVETGSLYVAQAGFELLDSSN 

PPCFSLPKCWDYRPP*ATTPS/FKND 

SHFNFLLNRF/SHFVFVF*VLRLNLC 

NN1P*GLKVGEIQPSPKAETKLGVER 

GGKNYIRFSK 

916 

6413 

A 

948 

9 

296 

RPSHQCRLLPPRASLGLSELCPEDQ 
QSYIP*LGHHSAECR*S/TSGGSCPLS 
SVSSQASRAS/GPTSLTTAAPTSPRT 
GASALTEQYWSNRFLNHFAE 

917 

6414 

C 

949 

114 

383 

MQMVGVWGGLGGIKQDKVLLSSE 
GPRSRDGGGTWRPTLKSTVRXXXX 
XXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXSALL* 

918 

6415 

A 

950 

1896 

2251 

IGTPLCRME1DPFLEEAVPWSSVSSQ 

ASRASWPTSLTTAAPTSPRTGASAL 

TEVGRPKT*DHKISSVSTK/TSSHCP 

GPEHTTSAVPVSRASSCPVTVTTKL 

SKHPLQLARRVGFTLLY 

919 

6416 

A 

951 

141 

439 


920 

6417 

A 

952 

278 

1177 

RHPLAFFKASRAGPQRPLDGTLGPE 

DSRASSPMIQNSRPSLLQPQDVGDT 

VETLMLHPVIKAFLCGSISGTCSTLL 

LQ\PLDLLKTHLQTLQPSDHGSRRV 

GMLAVLFKGVRTERLLDLWKGMS 

PSIVR\VSLGVGIYFGHSLLF*SSISW 

RRPIPQTALE\ANHAGGSGSRSVAG 

VCMSPITVIKTRYESGKYGYXKSIYA 

ALRSIYHSEGHRGLFCGLTAT1LRD 

APLSGIYLMFYNQTKNIVPHDQVDA 

TLIP1TNFSCGDICWYSGPHWVTSTC 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleoside 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770 160 

Nucleotide 
location of 
firet cnilnn 

III 31 tl/UUII 

for peptide 
sequence 

Nucleotide 
location of last 

cnrion fnr \n<t 

amino acid of 

peptide 

sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /"possible nucleotide deletion; \=possib!e 

iiulicuiiuc iiidcruuni 







GMFIQNSYGRLLFHLKVSMDWPST 

921 

6418 

A 

953 

109 

376 

SLTRAKSHLGQQFPTSRTSQEENWS 
P*RGSRTPPTPHTLPCQNLRPSMPT/F 
*QVNFKQFRGAFLSG*KSSVAIRTIQ 
QSNMALMGTYAL 

922 

6419 

C 

955 

123 

329 

MISRAPLPQLSELHCDKLHVDPENX 
KLLGNVLVTVLAIHFGKEFTPEVQA 
SWQKMVTAVASALSSRYH* 

923 

6420 

A 

956 

41 

565 

APSPRRPWGHFTEED\KATI\TSLWG 

K\VNVE\DAGG/EKTPGKGPLVVLPP 

WTPEVPLTSFGNLS\SASAHHGQTP 

KVQGTMAKKVLTSLG\DA\TKHL\D 

DSQG APFAQA* SELALVDKPAM WD 

PEVNFKASWGNVNLVTRFGQSHFRA 

KNFTPEGCRVSLGRKMGDLELASA 

LVPSRYH 

924 

6421 

A 

957 

1 

1000 

STRAPSPGPFPSSKLAGAYKSWCRR 

DPRTHSAGAQAAAARSVPIRCPAPT 

ASATMSHHWGYGKHNGPEHWHK 

DFPIAKGERQSPVDIDTHTAKYDPS 

LKPLSVSYDQATSLRILNMGHAFVN 

VEFDDS\QDKAVLKGGPL\DG\TYRL 

ISVFTFHWGSF*WDKVSEAYCGIKK 

KYAAEVLTLGHWNTKYGDFGKAVQ 

EPDGLAVLGIFLKVGSAKPGLQKVV 

DVLDSIKTKGKSADFTNFDPRGLLP 

ESL\DYWTYPGSLTTPPLLECVTW\I 

VLNF\PFSVSS\EQVFEIP*TLTFNGG 

GVNPEELMVDNWRPA\QPLKNR\QI 

KASFQIRWSHSLYSK 

925 

6422 

A 

958 

3 

402 

EELTMAGIFV*PTIPIPVSL/SLFCH*V 

LTLNSGISPAGSPVLIFSTPEPKR*TS 

QGESRFHTFYLKKLGLNR*I*HPSSS 

SSSSSSSSSSSSSSSSSSSSSSSSSSSSQ 

NRFLKPLQHSLPPPLKPLTYAPNL 

926 

6423 

A 

959 

1666 

2187 

NFPSSASPPPTDSFIAGLSSEAPSEHR 

SPSCALDPIFFQTWL**SFSFSSLNFI 

NMLKFVPPLNKTKPPLTLNFPYLKQL 

ASLPIQSCFFF*DKILLCHLGWSAVA 

QL*LTATSTSWAQVMFPRSWAYRH 

APPHT/LASCFYFCRDR/SLTIFPRLV 

SNSWAQVILPPRPPKMLGIQA 

927 

6424 

A 

960 

3 

695 

i 

TQLLRRPAVFVGSAASGIRRGLWSA 

SSGHWCAPAAGRAHAPVPRLVRGL 

GAASTAAPQDAQTGPQPMPRADCI 

MRHLPYFCRGQVVRGFGRGLQASL 

GIPATANFPEQWDNLPADISTGIYY 

GWASVGS\GDVHKMVMSIGWNPY 

YKNTKKSMETHIMHTFKEDFYGEIL 

NGAIGDYLRPDDNFDSLQSLISAIQG 

DFVEEAKK*LDLPEHLKLKEDNFFQ 

VSKSKIMNGH 

928 

6425 

A 

961 

60 

569 

STDLEELPTLGWF*KQELIILSCPFVS 

LTYRERLPANFFKFQFRNVEYSSGR 

NKTFLCYWEAQGKGGQVQASRG 

YLEDEHAAAHAEEAFFNTILPAFDP 

ALRYNVTWYVSSSPCAACADRIIKT 

LSKTKNLRLLBLVG\RLFMWEEPEIQ 


WO 01/79449 PCT/US01/08656 


SEQ ID 

fNl/. 01 

nucleo-tide 
sequence 

SEQ ID 
INU: 01 
peptide 
sequence 

Me 
tbo 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of lust 
codon for Inst 
amino acid of 
peptide 
sequence 

Amino acid sequence < X»Unknown; *=Stop 
codon; A=possibIe nucleotide deletion; ^possible 

nnplpfttirl a incpi*firin\ 
iiuuciHiuc niacruunj 







AALKKLKEAGCKLRIMNLV 

929 

6426 

A 

962 

62 

858 

QLRWDSGARAWPRPACLSPLPQRL 

LSHSPSMAQKEEAAVATEAASQNG 

EDLENL\DDPQKLKELIELPPFEIVTG 

ERLPANFFKFQFRNVEYSSXGRTKTL 

\LCYVV*STGARGGKVQASWG\YLE 

DEVHAACPLQKESFSFNTILPAFRPK 

PLAVTNVTAVGYVSSSPC\AACADR\ 

IVKTLSKTKNLRLLILVGRLFMWEE 

PEIQAALKKLKEA\GCKL\RIMKPQD 

FRILSWE\NFVEQ\EEGESKAFQPWE 

DIQENFLYYEEKLADILK 

930 

6427 

A 

963 

409 

747 

VILQAQGSPMGP*SLRAFPAESERC 

QKQERLEPEEGRTCAAGLRGGPR\R 

WWPLSSW\TGDLRPSPARN*ILPAA 

PMMEERKDPAPAQPPWTSNTLPQFV 

SPEVLCSPPIENSHT 

931 

6428 

A 

964 

1092 

2338 

RCYCSPPCFHLFQLSFQILDPPVLGT 
TFL 


6429 

A 

965 

146 

180 


933 

6430 

A 

966 

2 

921 


934 

6431 

A 

967 

1 

621 


935 

6432 

A 

968 

2 

152 


936 

6433 

A 

969 

157 

1203 

NNSGVMPEMPEDMEQEEVNIP**G 

GFWVTGCHWGFLGRAVHKEFQQN 

NL/WHAVGCGFRRARPKFEQVNLL\ 

DSNAVHHIIHDFQPHVIVHCAAERR 

PDVV\ENQPDAASQLNV\DASGVNLG 

KGKAAAVWEHFSILHLGSGF-CILM 

GTVNPPY\REEDIPASLN\LYGQTK\L 

DGRKGCPWRNHLGAAVLRIPILYG 

EVEK\LEKKCCELLMFE*KCQFQQQ 

SQAKQWIHWQARGSPPHMSKDVA 

PLCARQ\LAEKRMLDP\SIKGTFHWS 

GNEQMTKYEKGMCQLPDANFNLPS 

SHURDPITDSPVXLGAQRPRNAQLD 

CSKLETLGIGQRTPFRIG1KESLWPF 

LIDKRWRQTWFH 

937 

6434 

A 

970 

1 

508 

NSNRQNGPPKKGERERASN/C/YPG 

APAAQAE/APLVPLSRQNKSTVETS 

NLKMLISFPKTLLRGPQEGWWHQG 

INPGSGAATLGPGS/SPQRPQS/IAAS 

CSMARRTFFAVSSNSFFLLL/CFLCM 

GSSSGSQPSSSLKQKKHWAKSGSFS 

VGQWMKPASAIRSGVQRSPPRRAS 

S 

938 

6435 

A 

971 

21 

351 

VVSITKAPAYREVSVHNSCLRSNEG 

GKQPSHTKCLCNSNLLTTQFKTKPI 

E/CWPEKTYMGSSSGSQPSSSLKQK 

KHWAKSGSFSVGQWMKPASAIRSG 

VQVRSPPRRASS 

939 

6436 

A 

972 

1 

1011 


940 

6437 

A 

973 

2 

94 


CM 1 


A 

974 

661 

2244 

QYFKNPVGSTAVFEMDRLF1SSGTA 
EMTSRGF\QRS\CNNPP\CSSMTGRR 
ANQIHHLTPDFS\LRELL\PPK\KAGT 
WADCVSPPCGERDRCEGWADRHR 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 

t-hrk 
IIJO 

d 

SEQ ID NO: 
in tool" 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
locution of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=S(op 
codon; /=possible nucleotide deletion; Y=possible 
nucleotide insertion) 







A\CSSPAKSPTASSQVI*KGFKATAF 

FSRGRSGQHVAPRLHLGLSNHTAV 

GKREYLARRFLSLYSVSSSTRSFTPF 

PQ*LRMAFVLSF*I WC AIL* INNSN 

KIRKPCC*TVEEPGTLSPFSHHTQV 

MHLPKKKKTKKHNNKKTVL*GGT 

GVTDKEPAAQIQQAPGKVPPQDCD 

SLNNMRSRHIHCGRLCNHANKAVSS 

SKRDTAFFLPHFSPGKPGNQNSKNE 

PPKKRERERSSHC/YPAAPAAQAEA 

PLVPLSRQNRSTVETSNLKMLISFPK 

TLLRGPQEGWWHQGINPGSGAATL 

GPGSSGQRPQSM\AASCSMARRTFF 

AVSSNSFFLLLVSFAILFLALSIASSF 

KKFNQRVNSSXNCFLD/TERKAQPG 

RNCFNLCSSMGSSSGSQPSSSLKQKK 

HWAKSGSFSVGQWMKPASAIRLRG 

CRRSPPRRASS 

942 

6439 

C 

975 

597 

683 

MWFHVCLLVVIFFYFLVHMKYLKC 
KFLG* 

943 

6440 

A 

976 

224 

290 

MPIPIPMPIPAPVPFPGPGFPP/VYFPV 
PLPLP/LLFPF*PLPFP*PEVSAKPVTL 
WSRKRQRSKGEKEGRGRGTGK 

944 

6441 

A 

977 

3 

1815 

HFVPVSPEAAAAESTACGVTAKML 
SVRVAAVVVRALPRRAG\LVSRNA 
LGSSFIAARNFHASNTHLQKTGTAE 
MSSILE\ERILG\ADTSVDLE*NLGVS 
*SIGDGIAPRTWG*RNVQAEEMV\EF 
' SSRLKGYVL*TLEPDNVGVVVFGN 
DKLIKEGDIVKNRTGAIVDVPWGEE 
LLGRVVDALGVNAIDGKGPIGFQRR 
VGEVGL\KAPGIIPR\ISVREPMQTGI 
KAVDSLVPIGRGQRELIIGDRQTGK 
TSIAIDTIINQKRFNDGSDEKKKLYC 
IYVAIGQKRSTVAQLWKRLTRCKM 
PWKYTIVVS\AT\ASDAAPLQYLAPY 
SGCSMGEYFR\DNGKHA\LIIYDDLI 
QNKAVA*PVKMSLLLRRPPGREAY 
PGDVFYLHSRLLERAAKMNDAFGG 
GSLTALQV\IET\QAGDVSATIQTDV 
NSI/TLPEQIFLETEW\FNKG\IRPAIN 
VG\LSVSRAGSPPTNPGAMKPGSQV 
PWKLELASSIREVACFLPPVSVSD\L 
DAATSTNLLESVAVPS*LEFAESQG 
PVFSPWLIEGTSCLLSYAGCKGDIFD 
KLEPSKITKLRNAFLSHWSQHQDL 
VGALCRADGKISEQS\DAKLKEIVT\ 
NFLAGFEA 

945 

6442 

A 

978 

532 

878 

SYHFGRPRQADHLRSGVQDQPGQH 
EETPSLLKRQK\LAGHGGIYL*PQLL 
GWLRQENCLNLGGGGCSELGLHHC 
IPAWATEQDCLKKQNETKKESYS** 
GTSCLIAFLFILKSDQK 

946 

6443 

C 

979 

36 

236 

MGPTIPDXSXFFWRKPITWMPTWE 

/^TrOVTX //^TJ^DT CCCT/OT T TPVH /^t Tn A m 

u 1 oN VOryJrJ-rbbbKSLHSXRGHPAPI 
PTGQAGPRDSGPGASP* 

947 

6444 

C 

980 

26 

160 

MRFQSTGLGAPHCALNKCVSCLNX 
XXXXXXFLLRGPKLNPFKGG* 


WO 01/79449 


SEQ ID 
NO: of 
nucleo-tide 
sequence 


SEQ ED 
NO: of 
peptide 
sequence 


Me 
tho 


t 

SEQ ID 


PCT/US01/08656 


ID NO 
in USSN 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon for. last 
amino acid of 
peptide 
sequence 


Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; ^possible 
nucleotide insertion) 


948 


6445 


981 


1 


523 


FFPKGFFLGYPRVFPFPPFLNPGPGQ 

I*FSPP*K/RKLRAP*KKASSSSSSVK 

TGNTFIESTVGCAKPCALEPHHSAP/ 

PQQPAPSGPGPPGEPG*ERLCASHK 

AFISHKQSH*SPQ*PCQAGVTLSRLQ 

TTNSPRP\PVRRGCGGLEPRP*VS/PS 

QPTACSENSQGSQPSPKRTLS 


949 


6446 


982 


145 


1315 


CLPPPGLGPAPSLSSSCWGPPPMPRT 

IRGIPPGSICRLCFCPAAGLLSLVAL 

QRPASSPTRHSALLCLLRWA*PTSPS 

LPQSLGSRAQLHSPLP*QAGSPCHA 

HTCGHLGPPVV*VPLPCPRSHSHSS 

CLFPAGSQCP*QAPAGVSISCQASSS 

ASSCGPGPQGYGQTPGP1PETPRDR\ 

PCHPTPPKSKLQARGPWRAWVG*R 

TADGSCGKKP*/CGPTIPDPKSSPLA 

GLSSPFPWFWAETHHPGMP*RGKR 

DPRKCGNPQPPSPKLRKSPPTAHGD 

HPAP*FPTGKGWTPKDSWTPGPPP* 

SRRPRPLN*WTAA*PWGQNPWAWT 

PAHPRKKP*RPRGSCLSLSVSACGK 

WAPSPTSQGCCEGRCDAVPKQQGL 

AHPTVLLNKC VSCLN 


950 


6447 


983 


682 


PPLFFQAAGKMADIQTERAYQKXQP 

TI\FQNQGRGSCCGETGK\EKLPR\Y 

YKEHSGLGLQRHPKEAY*GPPTLTK 

KCPF\TGKCVHFEGGILSG\WTK\M 

KMQRTIVI\RR\DYLH\YIR\KYNRLP 

RKRHKNMSV\HLSPCFQGTSRFG\DI 

VTGGAKCRPSEARQWRFNVLKGH 

QGLARHQRSSFQKFLRLGHRGRSPT 

MEIRLVPASPGKKKKKSFVPTGGAV 

DSIGGRGV 


951 


6448 


984 


465 


952 


6449 


985 


1585 


2239 


953 


6450 


986 


2580 


SLPPKKCELRLLHNCWGLFSPPARQ 

SGSVAAVVAASAVSGVSGPQSPPLT 

CASSSLRSPRPARTRPVAVCVSPTPP 

RLPPRSSLRADMSGDHLHNDSQIEA 

DF\RLMDSLlCHKD\KQKDENARAR 

GHKEEKDRVEESKHSNSEHKDSEKK 

HKEKEKTKHKDGSSEKHKDKHKD 

RDKEKRKEEKVRASGDAKIKKEKE 

NGFSSPPQIKDET*DDG\YFVPPKEDI 

KPLKRPRDEDDADNK/PHKKIKTED 

TKKEKKRKLEEEEDGKLKKPKNKD 

KDKKVPEPDNKKKKPKKEEEQKW 

KWWEEERYPEGIKWKFLEHKGPVF 

APPYEPLPENVKFYYDGKVMKLSP 

KAEEVATFFAKMLDHEYTTKEIFRK 

NFFKDWRKEMTNEEKNI1TNLSKCD 

FTQMSQYFKAQTEARKQMSKEEKL 

KIKEENEKLLKEYGFC1MDNHKERI 

ANFKIEPPGLFRGRGNHPKMGMLK 

RRIMPEDIIINCSKDAKVPSPPPGHK 

WKEVRHDNKVTWLVSWTENIQGSI 

KYIMLNPSSRIKGEKDWQKYETAR 

RLKKCVDKIRNQYREDWKSKEMK 


WO 01/79449 


PCT/US01/08656 


SEQ ID 

NO* nf 

nuclco-tide 
sequence 

SEQ ID 
rtyj, 01 

peptide 
sequence 

Me 

♦tin 
I 111 

d 

SEQ ID NO: 

In 1 ICON) 

09/770,160 

Nucleotide 
locntion of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possib!e nucleotide deletion; \=possible 
nucleotide insertion) 







VRQRAVALYFIDKLALRAGNEKEE 

GETADTVGCCSLRVEHINLHPELDG 

QEYVVEFDFLGKDSIRYYNKVPVE 

KRVFKNLQLFMENKQPEDDLFDRL 

NTGILNKHLQDLMEGLTAKVNFRYT 

YNASITLQQQLKELTAPDENIPAKIL 

SYNRANRAVAILCNHQRAPPKTFEK 

SMMNLQTAKIDAKKEQLADARRDL 

KSAKADAKVMKDAKTVKKVVESK 

KKAVQRLEEQLMKLEVQATDREEN 

KQIALGTSKLNYLDPRITVAWCKK 

WGVPIEKIYNKTQRE\KFAWGHLT 

WLDEDYEF 

954 

6451 

C 

987 

65 

235 

MQFRVKYHICSTLLSLKTKICITCIIP 
SHLFPASTIPSWGCFHLYIHIAQKHV 
ING* 

Arc 

6452 

A 

988 

16 

148 

SPAEQGCVCVCVCVCVCVCVCVCV 
CVCVCVCVCVCVCVCVCVCVQVG 
ACVCN/CVCVCVCVCVCVCVCVCV 
CVCVCVCVCRWVLACATCVL 

956 

6453 

A 

989 

287 

504 

LPRNFKTIYLDSEMVLESSKRGVCV 
SVCVCVCVCVCLCVCVCVCVCVCV 
/C*YLDLNHGKCTHPVSFSVRIFLA 

957 

H A C A 

6454 

C 

991 

201 

488 

MGSRPRFCLFTTNTLCPDTVTSSVC 
SPKTTXRRLKXTFMPRCSRKPQAVL 
TSSEMALAACSXFSRSPDDFTQYQV 
AELVWDSLQPLGQXRSHCSLR* 

958 

6455 

B 

992 

53 

302 

MTSALTQGLERIPDQLGYLVLSEGA 
GLASSGDLENDEHAASAMSELVST 
ACGLRLHRGMNVHFKRLSWFGEH 
TLLETRVLTEX* 


6456 

B 

993 

277 

573 

TSALTQGLERIPDQLGYLSSGDLEN 
DEQAASAISELVSTACGFRLHRGMN 
VPFKRLSVVFGEHTLLVTVSGQRVF 
V* 

960 

6457 

A 

994 

134 

1271 

NPGPVQVGVEGGQEEGPSSKKQAK 

TRQVCLASITEAPGPKIRFSEPLRPP 

AGCRHQMGSPTSGSASSPQTPFCPG/ 

PPSPAVCV/PPKTTGGETHQTGA*RA 

HSMPRCSRKTAGCAEQLQRWHWL 

PAHHSPGPQMTPALHLHSVPGSRA 

GLGFAPAPGSAQKSSG*RCKS*EAC 

*RDGRPDTLHLQTQVSGLT\WPQVF 

SFPSQVPSRPPPPYMVLNTDLPEPPS 

APTLAPRLPWPSTSHLCYPKGPVLP 

LWPLPSDP\SSPPFVSARPA/ALPAAP 

EHPPTDPSPAFSSPSLPFPSPLPPRAD 

RR*GWSAGPPGG/EPHRLGSRDAEP 

PAGPLAHASSLTIAVFGAGGAPYQI 

GSFRLQAPVTCLQPLRSSFCLRHWP 

LAPPLA 

961 

6458 

A 

995 

1 

422 


962 

6459 

A 

996 

3 

760 

TSRGRVGTQAGEPRDLRPPPCPSSPL 

RV\AVV\CLEQPERGAWEAHNIPQP 

NGDSAVRSFG\TGTHVKLPGPAPD\ 

NPNVYXDFKTTYDQMYNDLLRKDK 

\ELFTQNGILHIAG\RNKRJKP\GPERF 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nuclco-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







QNCKDLFDLILTVCEERVYDRVGWK 
I*ISR\EQGDLPSPVHWN\LDIQDNH 
EEATLG/ARFLICE\VCQCIQHTEDM\ 
ENEIDELLQEFEEKSGRTFL\HTV\CF 
ILFSFLDICF 

963 

6460 

A 

997 

425 

591 

EYLKQWNVFVFDIKNHLC/MHLHA 
HIHTQV*THTQTHTHRHTHTHTHTP 
CHCLVHTA 

964 

6461 

C 

998 

120 

368 

MGTAHKPGYPQISTTVCIYVPDFAIL 
SPVLSFCRLIYLHTRMLQATELLLQ 
ALQN STTKGNGRRERARDN SKMRH 
DGRAWRC* 

965 

6462 

A 

999 

2 

393 

ARGKKPNLRAFLLPLPRAWPEPQTL 

QQPRWP*TVQLPVPAIGDKALALGR 

RKSGPGPGFVVLRTVPHALSHVRSL 

FRSLLLPLSLTLCVSFSFCLSVFPSPL 

PCSDGANPWQSQSWKQEDGSSSPW 

EVSA 

966 

6463 

C 

1000 

1247 

1716 

MLRPRLRCFFSSSQTACSSDLNRSV 

GSSQARPQGSPSSFSASCRKCSSVLH 

KEGPQLLWGQVPSFYGPSTCPFHT 

AAADSAWPCRSRASFKVLSHDFHR 

PLVLLAAQRLPPARFPLGRLGARSH 

TAGGAERAGVGAAQQLQRRPRWP 

GRRARAPRP* 

967 

6464 

A 

1001 

3 

630 

FCPRGQEFGEGNKLLSPRRPWGHFT 

EEDQGLLSTSLLGQG*ILEDAGRKK 

PLGKAPLVV\YPMGPQGFL*TGFGQ 

PCPSCPLPIMGQPPKVQGHMAKKW 

LTSLGEMPIKHNLDDPQGHLCPSLSE 

LHCDK\LHVDPENFKLLGKCAGD\V 

FGNPFRQRIHPWRLQASWQKMAED 

GDCKWPVPCPPDTTEASWPMNSEA 

FKDKAFILASNYK 

968 

6465 

A 

1002 

41 

625 

APSPDAHGV1SQRRTKATI\TSLLGQ 
G*ILEDAGGRKPLGKAPLWLPKW 
DPKRSFEQALGNPVPLPSAMMGQPP 
KVQGHMAKEGA *PSLGEMPIKH\LD 
DPQRAPFAPA* SELH\CDKL\H VDPE 
NFKLLG\NVLVTV\LAIHFGKEFTP/E 
RLQASWAEDG\D\*SGQCPVLQIPLK 
PLGP* IQKLSKDKAFILASNYK 

969 

6466 

A 

1003 

106 

1315 

KQSGRAPGKVVSRAFPGLNPCPGW 

K\LLTQ\VGAQVLGRLGDGLGAALG 

PGNRTHIWLFVRGLHGKSGTWWDE 

HLSEENVPFMKQLVSDEDKAQLASK 

\LWPLKDEPWPI\HPWETGSFKV\GL 

FDLKAGHVGLLWTKDGQKHVVTL 

LQVQDVCHVLKYTSKENCNGKIGNP 

VC*EGKTVSRFRKATSILEFY\RELG 

LPPK\QTVK\IFNITDNAA1KPGTPLY 

AASLFVQGQYVDVTAKTIXGKGRQ 

SCSLKRWGVFKGQPAYRIGQTENPT 

GRPGAWATxGDIGRVWPGTKMPGK 

MGKHIHGQNMGLKWWRINTKPPm 

YVNGSVTWDIKNCLVKVK\DSKLP\ 

AYKDLGKNLPFPYIIFLMGDGRGNL 

PERFCID*KPCCQPRWRPPINICPNIL 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 

llUCIclr-UUc 

sequence 

SEQ ID 
NO: of 
pepiiuc 
sequence 

Me 
tho 

SEQ ID NO: 
inUSSN 

vyf / /UflDU 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Un known; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







WDVGRTLTIFL 

970 

6467 

A 

1004 

218 

530 

KFEGCLPPRDETRIPIWARRCAYVV 
* SKE VSL*HCNTVTPG\GKPNKTVRVI 
WGK\VT\RAHGKPVAMVRAQIPEG 
NLSCLRPFGHRURSDGCYPSRILNLT 
EKSN 

971 

6468 

A 

1005 

2 

269 

FESEDEGEEFDDWEDDYDYPEEEQ 
LSGAGYRVADKMFLRTREPALDGGF 
QMHYEKTPFDQLAFIEELFSLMWN 
RLTEELGCDEIIDRE 

972 

6469 

A 

1006 

1 

456 

AELSELYEESSDLQMDVMPGEGDL 

PKWNEEATGTRAAIPWVPPATGAQQ 

LEEEGPMEE\EEAQPMAAPEGKRSL 

ANGPNAGEQPGQPSPGRRTSRAEDE 

A\EEFDDWEDDYDF\PREEPVKGAR 

LRFLPPS*KTPPSFWENRNTPLWGG 

LKIFY 

973 

6470 

A 

1007 

563 

771 

WSMVSLCSTAAVAPVCSRIPERTRR 
ATASVTHFCGDLG*SPVKPLSLGYR 
SNLGGPEEGREGGRKARRK 

974 

6471 

A 

1008 

300 

391 

AVCTMSEMAELSELYEESSDLQMD 

VMPGEGDLPQMEVGSGSRELSLRPS 

RSGAQQLEEEGPMEEEEAQPMAAP 

EGKRSLANGPNAGEQPGQWAGAD 

F\ESEDEGEEFDD\WEDDY\DYPRRR 

SQLQWCPVTRVSAALEEA\DKMFL\ 

RTREPALDGRVSRCHWEKTPFGSV 

SLLSEELFFT*LVCQSV*PEEL\GCDA 

WRG 

975 

6472 

A 

1011 

100 

270 

LRSSAVTVLVSLIHSPSSFAHHPSVD 
T* PHCLESPPGFKAIFIRGRLFTEACF 
CRIA 

976 

6473 

A 

1012 

13 

670 

RQRPKARASIPLHQPPQEACQPPAA 

ALTRPQPRP/PSALSHPAKPHSVSSA 

GSSYKNNPFASSISKHGVSSGSSSSG 

GTPVQSSVSGSLVPGIQPPSVGQATS 

RPVPSSAGKKMPGFPEVDSGSPSRR 

SK\GDSSGGTQGVAKLLTSPSLKPSA 

VSSVT\RLPPCQKERVGLCCWPAPL 

*WLHPTNPAAQSCLGP*ARTPRGLG 

AAGVSLAQRKSLSTYRA 

977 

6474 

A 

• 

1013 

3 

578 

GIPWWTHASEAVQTEIPVVGPREW 

QSC*PRR/RLKPSAVSSVTSSTSLSK 

GASGTVLLAGSSLMASPYKSSSPKL 

SGAMSSNSLGIITPVPIPV\QCSPSAL 

TPLPKQGSPRMPSSQALPPGPST\RP 

WPQSSGWLALQPAPCSASPTRCGA 

HPYPAESAR*SPDGQCAHAHRVPLP 

SPPLGALPLPRVLVSLP 

978 

6475 

C 

1014 

426 

653 

MVTWGGGSHQRERKEGPGTRVFM 
GREALESPCSASHCRPLLGFELSNT 
NLLLWLFCYYLRLLCKQTGNPSCK 
KYI* 

979 

6476 

A 

1015 

1286 

2318 

RTVPFYPMHLMVMMKTEPKIGVCK 
NPIIIVESTKVFSLKELHCHVPREKL 
APTVST/PLGVLSISQGQCRSPSCGS/ 
GPEFCPL*AHSLGMGRHWDHPGSL 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno»vn; *=Stop 
codon; /^possible nucleotide deletion; V=possible 
nucleotide insertion) 







LQPEGAPEQWGALRYSDQMPGDQ 

ADSPTGVPTALPPEPVNTVPPTTAQ 

RKPQPRAAPLTTV*RQSCKAQSTGQ 

SAPPEQQGPGSLHGRHIRSAEKRSA 

ENSRSSRRTPSSRKGAVSVHPGRPA 

RSDS\PRPKWQALPLKRSQFPWYSR 

PFPVKLRSTWKRVRKIRTQVFHAL* 

FP*GSSNQTSRETPK*QKSPNAECSR 

AQTSHETPASSYSCTLLLATACKHTI 

TVKVGGHSAQCFQNSNSAVQL 

980 

6477 

A 

1016 

315 

480 


981 

6478 

A 

1017 

1 

1845 


982 

6479 

A 

1018 

1 

447 


983 

6480 

A 

1019 

218 

544 

SGFSLSLRGSISFSL/CFKVGMCVTY 

PRCYC*S\PVP*KPLIKPGWVSELPKP 

MNWASCSRTLASFLLQAARSHPWT 

PMKMETMEQSKLRSQPCKPAAGAV 

DPRAWHGT 

984 

6481 

A 

1020 

198 

562 

LKCGKQWSDPITSPQTESQLLGSSQ 
QQLHQQRHLQGP*K*NRDSPLCFSL 
SSCPIPKTYTNRHILLP/SSSKSLCKF 
L*PA*ISPQKMGFSFLSQSVCKFSKL 
LCSASLIKLKAFNSIQVTS i 

985 

6482 

A 

1021 

72 

270 


986 

6483 

A 

1022 

545 

812 


987 

6484 

A 

1023 

477 

750 


988 

6485 

B 

1024 

537 

867 

XEQAAPCSALSLPMASCLRGHGEA 
RADPWSSTRP1DLKFKGPFTDVVTT 
NLKLRNPSDRKVCFKVKTTVPHRY 
CVRPQQWNLLTPGSTVTVSVMLQP 
FDYDPNEKSKHKFM* 

989 

6486 

A 

1025 

822 

1750 

SSAEPSPPSSPAPSQQTAAAGAPPLC 

AVSPMASASGAMAKHEQILVLDPP 

TDLKFKGDG*VFIRPEQYYTV*KWC 

KRSKRHGPFRP\FTDVVTTNLKLRN 

PSDRKVCFKVKTTAPRRYCVRPNS 

GIIDPGSTVTVSVMLQPFDYDPNEK 

SKHKFMVQTIFAPPNTSDMEAW/W 

KEAKPDELMDSKLRCWFEMPNEN 

DKLNDMEPSKAVPLNASKQD\GPM 

PKPHSVSLVNDTETRKLMEECKRLQ 

GNEMMKLSEENRHLRDEGLRLRKV 

AHSDKPGSTSTASFRDNVTSPLPSLL 

VVIAA1FIGFFLGKFIL 

990 

6487 

A 

1026 

184 

282 

VIASQNIFVLSSVTGPKDRSGRQPLV 
FLKSPG*THPS*SVSRNLFS\FA*PGD 
FRKTKGCRPDLSFGPVTLLRTKIFW 
LAIT*D*CFLGYIKMGHIVEHCQQ 

991 

6488 

A 

1027 

445 

992 

HCCGRNCLQRRWGWKRVRSLLAGI 

VFVSPFFKLELQKPLPSEQITIGLML 

LPFPHFFFCCFVFCFLFVCLFF*DRV 

MLCHPGWSAWRSQLTVTSASRVQ 

A1ICLSLPSSWDYRHPPPCLAFSR/DR 

GFTILARLVLELMTS*ATTPSLFCCFI 

ADSVQQYAPSLYILRNTNPRLLLAK 

IFVA 

992 

6489 

A 

1028 

33 

476 

HEDHAGPEPPRSYIPPYNATVVQKL 


WO 01/79449 


PCTYUS01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknow'n; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







LDQGALLMGKTNLDEFAMGSGSTD 

GVFGPVKNPWSYSKQYREKRKQNP 

HSENEDSDWLITGGSSRGSAAAVSA 

FTCYAALGSDTGGSTKNPAAHCGL 

V/GFKPSYGLVSRHGLIPLVNSMDV 

PG 

993 

6490 

A 

1031 

187 

1611 

RAWERQERQRSGCSWLTRGVRGG 

GEGRTTRRKMASKEMFEDTVEERV 

INEEYKIWKKNTPFLYDLVMTHAL 

QWPSLTVQW/LSLKVTKPEGKNDMP 

LHW\LGLGTHTSD*SRIILVVARVHI 

PNDDSQFDA\SHCDSDKGEFGGFGS 

VTGKIECEIKIN\HEGEV\NRAR\YM\ 

PQNPSHPLLTKTPSCLMVLVFDYTK 

HPAKPDPSGECNPDURLRGHHKEG 

YGLSWNSNLSGHLLSASDDHTWCL 

WD1NAGPKEGKIVDAKAIFTGHSAV 

VEDVAW\HLLHESLFG\SVADDQKL 

MDM/WDTRSQYHLPRPSSLGWDA\ 

HTA\EVNCLSFNPYSEF1LATGSA\D 

KTV\ALWDLRNLRLNLHTFESHKD\ ' 

EIFQ\VHWSPHNETIFGFKCTCRR\LN 

VWDLSKIGKEQSPEDAENGPPEI\LL 

IHEGHXQVKISDFSWDPPMKPWVICS 

WSEDNMMQIWQNGLKIFYNDERVR 

CHDHPKLEGKGS 

994 

6491 

A 

1032 

3 

551 

FLLAPVEVSEGSFAEIWGQITGVGLF 

LCLGESPACWERGLSKRDLMSVKA 

CGPKAHFCLGYKAGGLPGTQRGAT 

QALL*KFEGVYARRMKPDSNLG\RR 

CAYVV*SKEVSL*HCNTVTPG\GKP 

NKTvRVIWGKSKLGAHGKQWAWF 

VPKFPKAIFPA*RPFGHR\IRSDGCYP 

SR1LNLTEKSN 

995 

6492 

A 

1034 

20 

867 

ALERRVRKSGDCCTDSGTMNIFDR 

KMTFDALLKFSHITPSTQQA/HMKK 

VYASFA\LCYFGAA\AGAYVNMVT 

HF\IQAGL\LSALG\SL1LMIWLMATP 

HS\HETEQKRLGTSLLGFCIPYRKLG 

LGPALGSFVIAVKRQASLPTAFMGH 

SNGSFPAFTLSALLC/RRPRSYLFLG\ 

GIL\MSALSLVAFCLPLGVNVFFWIPF 

WVFQA\NLYVG\LVVMCG\FVLFDT 

QLH\EKAEQGDQDYNL/WHC\IDLFL 

DFMTVFQKNSMKDPGP*MKKDKKK 

RRRNEVTIQPFPI 

996 

6493 

A 

1035 

153 

546 

PAQETGRPRSKAHVASTWRAFPPE 

DQVVLLAGAP/LWEDEAHFWAKCG 

VEAL\TTLEVTRPACLEGK\VHGSL\ 

ARAWKSDEGQTPKWAKQGERRKK 

KTGSG*/RRPDSSYNRRFCQTLLPT\F 

GKKEGPQWPTS 

997 

6494 

A 

1038 

1 

433 


998 

6495 

A 

1039 

101 

1898 

SAAMIGGLF1YNHKGEVLISRVYRD 
DIGRNAWDAFRVNVIHARQQVRSP 
VHQHLLRTSFFHV/IRRSNIWLGSSS 
PRQNA^STVAMVFEFLYKMCDVMA 
AYFGKIS\EENI\KNNFWHYMELLD 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







EILEFGYPQEFRD/SGALKTFMTQQG 

IQGVQAS/AQKEGSSQFTSQVTGQIG 

WRREGIKYRRNELFLDVLESVNLL 

MSPQGQVLSAHVSGP/VWVMKNYL 

SGMPECKFGMNDK.IVIEKQGK.GTA/ 

G*NQARAGKQS1AIDDLHLSTQCVR 

LSNKFDSERSISFIPPDGEFELMRYRT 

TKDHLPFRVIPLVREVGRTKLEVKV 

VIKSNFKPSL\LG\QKIEVRIQPPLNT\ 

SGVQVICMKGKAKYK\AGENSFVW 

KJK\RMAGM\KNTHIISE1GFLPNKEK 

KKGGCPPLFPRNFGFKFAPSGLKVG 

\YLK\VLNPKLNYSDHDVIKWVR*\I 

GRVSGIYENSACKATRQLAQLPQPPF 

LQQVQVPLLPQTTHQVSPSPPCFAC 

PSPLQPSPEVLGLGQSNITKWDRWK 

QPLGSPGQGSSEGSCSPHPPVCSWP 

NAQALSSVTQSPGGFPFLPHPCGHS 

SGVGGLVAPHLRAPPKASEWIPGLS 

PYSALG 

999 

6496 

A 

1040 

255 

662 

TGEGYAGTEATDITHPQLRNQQWV 

PRCKPFPICDLKIQPERNYLFFLR/QR 

VSLCHPGWSAVV*SRLTATSAFPGS 

SYSPAFSLPSSWDYRCAPPRPANFCI 

FSRD/RGFTMLARMVLIS*P\VIHPP 

WP\P\KVLGLQA 

1000 

6497 

A 

1041 

.2 

297 

TLILPQHVNCPGGINAWNTITSYIDN 
QICQGQKNLCNNTGDPEMCPENGS 
CVPDGPGL/VLRDSGSHHSIRLHSA/ 
LGDPAPKSQDFMNYIGLTIDLRSI 

1001 

6498 

B 

1042 

1 

786 

MAPHDPGSLTTLVPWAAALLLALG 

VERALALPEICTQCPGSVQNLSKVA 

FYCKTTRELMLHARCCLNQKGTIL 

GILPQHVNCPGG1NAWNTITSYIDN 

QICQGQKNLCNNTGDPEMCPENGS 

CVPDGPGLLQCVCADGFHGYKCM 

RQGCLSAAAPQALAGKWPPKNCHL 

PSFVDGQPQGQKEPCNNYPSIYTFV 

PVCGQICGIKMLKTELLHLKYWDIG 

PGNRNSYKFAAGNVKFAVTLENSL 

LIPQKAKRNYHMTQQFRS* 

1002 

6499 

A 

1043 

137 

1021 

GRAEAGSLASQCVALASGSPVLLG 

GPAVLISSLTLPDPAQPDMHPMSRE 

RAKFVKSGL\YCKTDTKS*CLHA\RC 

CLNQKGT/1LWGLDLQN\CSL\EDPG 

QNFSIRHITTVII\DLQANPLTGDLAN 

TFARGFTQLQTLILPQHVNXCPGGINA 

W\NTITSYIDNQICQG\QKNLCNN\T 

GDPRKCVPEKGILCYLNGPKVFWQ 

C\VCA\DGF\HGIQSVLPPGLVPHCL 

MFLREFWEPTHSIRLAHSVAFGGPSA* 

KAKDFHGTYIGSLPFDLKINLELSLA 

PVRGALLPRKGIFRPVGFRLKG 

1003 

6500 

B 

1044 

203 

308 

XRPLFAPVRARAVEAAGPGSGRAA 
EHSGPTGTAGCA* 

1004 

6501 

A 

1045 

78 

308 


1005 

6502 

C 

1046 

35 

259 

MQFSTHRGQKYERTPDTSGARVIER 
PYLTVIIIHNNLEGRLKEKSGKPYKF 


WO 01/79449 


PCT/US01/08656 


SCQID 
NO: of 

niirlpft-tirtp 

iiuviviruuE 

sequence 

SEQ ID 
NO: of 
nentide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770 160 

Nucleotide 
location or 

11131 lUUtHI 

for peptide 
sequence 

Nucleotide 
location of last 

pmIaii IVkt* fact 

amino acid of 

peptide 

sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possibIe nucleotide deletion; \ppossible 

iiucicuuuc in&eriion/ 







AFYVILRCHFTDNFTTAALFVTR* 

1006 

6503 

A 

1047 

162 

1341 

PPFQLIMGEIKVSPDYNWFRGTVPL 

KKIIVDDDDSKIWSLYDAGPRSIRCP 

L1FLPPVKWNCKMSFSRQILALTWM 

GLTGLIAFAVIPVYCDHLEFCDGFR 

KLLDHLQL\DKVHLFGRFFGKGLWP 

MRFTGIPLQIS*SPIPLILCNSL\MDTS 

IFNQTWDCKQLLA*CPAFMLQKNSS 

LGMFHPGPGGPYGWAGCHLDFHG* 

DRA*ESFGVQELNLAFQGLTLELSK 

FLCGNLH*NSGTYL*TIYGMCLDPG 

GGAFQLKLKKKWYKLYPNGPKKL 

HLKTGGNFPYLCRSAEGNXLMVQIH 

LLQFHGTKYAAIDPSMV\RPRSLRC 

RKAALASARRSSSVSSSPVNDELTP 

VCSLYSQWAFSTRSDRPFPFQVPSG 

LTRGPHWGLGKVGLDGHLP 

1007 

6504 

A 

1048 

321 

888 

VELSVHPPIPADPRSLLAGAMPWKL 

PISLPAE\PPCSLSCCLLPPATQHPLH 

PALPAVDGAKKNPVFSGRLPPPP/PT 

QRTSASGISALYA*DREV*AQISELW 

AMRG * VQKVGGTVQISRAGQLAAV 

TSVGNMSVYPLALMTPPP\PSPPLPP 

PPPPVGRWSVGLRDLSPAVPSSSEV 

CLWRSVLCLIPGY 

1008 

6505 

C 

1049 

137 

320 

MLKSSFTCFPTEKGPKFLEDNLKTK 
XXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXLV* 

1009 

6506 

A 

1050 

231 

477 

GVYFIFFIGLWFCFVLFDDGRFTVA 
QAGVQWCDLGSLQPP\PP\GSNDSP 
WPQLPE*LGITGACHHARLIFIFLLE 
TGFRPC 

1010 

6507 

C 

1052 

60 

523 

MAEAGFIHCPTENEPDLAQCFFCFK 

ELEGWEPDDDPMQRKPTIRRKNLR 

KLRRKCAVPSSSWLHGLRPLRXLPG 

PRVAAPLPGFIPWCHQPSCGPLSNV 

LGKEINIFKLDVSTVLLFCLESGTRG 

VLPVQRVLLVQLLALSLSPFFGGPF 

WRFE* 

1011 

6508 

A 

1053 

61 

208 

IFETGQRKSQEQNWSYSVTQA/GVQ 
WCDLGSPQPRPPGLK*FSHLSYMGS 

1012 

i 

6509 

A 

1054 

198 

1011 

QTHGLQQPSQHLP/TSTLRTVTAST/ 

SMRSRH1HCGRLCHANKAVSSSKR 

DTAFFLPHFSPGKPGNQNSKNEPPK 

KRERERSSHCYPAAPAAQAEAPLVP 

LSRQNKSTVETSNLKMLISFPKTLLR 

GPQEGWWHQGINPGSGAATLGPGS 

SERPQSIEASCSMARRTFFAVSSNSF 

FLLLVSFAILFLALSLSSFKNSPRVNS 

SNCFLTERKAQPDECFLCSSMGSSS 

GSQPSSSLKQKKHWAKSGSFSVGQ 

WMKPASAIRSGVQRSPPRRASS 

1013 

6510 

A 

1055 

1077 

1457 

ARRPEPPHPAHRRGGGTPAQGGAG 

SPGASSDTSLRLEAPPQSIACWRSCC 

NAASWWTRSRGTCSRQTQR*GWP* 

CWRAQRGCAPALAAPQPFLAAHHG 

QRSAAASTTPGLHAGLRRSRRPPRP 

PRP 


WO 01/79449 PCT/U SO 1/08656 


SEQ1D 
INU: oi 
nucleo-tide 
sequence 

SEQ1D 

[HKJ. OI 

peptide 
sequence 

Me 

(no 
d 

SEQ ID NO: 
09/770,160 

Nucleotide 

1 tion nf 
lUCULIUH UI 

first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon Tor last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; V=possible 
nucleotide insertion) 

1014 

6511 

A 

1056 

2583 

3580 

DRVSLLLPRLECNGA1LAHCNLCLS 

GSSDSPASASQVTGITGKCHHTQLIF 

VFLVEMGFHHIAQAGLELLTS\DSPT 

LASQSAG1TGVNHHAWLFFFCS/RD 

TVSLCYPGWSRVA*SRITATSA\PGL 

K*FACFSLPSSRDYRHVPPHPGNFCI 

FGRDEVSPCWPGWF*TPDLR\YPPA 

SASQSAEIIGVSHHTWPQEVFLFLNL 

FIYLRWSL/DSVAQARVQRRDLGSL 

QAPPPRFKPFSCLSLPSSWDYRRPPP 

HPANFFVFLVETGFYTVLARRVLIS* 

PRDLPASASQSAG1TGVSHHTR/LIF/ 

WPPPTnTHWTWA AVnWYTT/n^T O 
iNrriji u i no v i w/\/\ vy w i i i/vjojuv^ 

PRTPELK* SSHLILTSN WDYRCTPPC 

PPNLF1YLF1YFHRDEGSLCCPGWS* 

TPELK 

1015 

6512 

A 

1 AM 

1057 

loo 


WQnwnTD T4T QA AQFXKHAFWNJT P 

EELTLVKGQALPSRGLDCPGRPAPP 

AACPGPKWRACLWV*FPNQNPISPG 

RNFNLKT 

1016 

6513 

A 

1058 

10 

604 

ARPPKKPTSPTATTTTR*GARPATPP 

PRRPTAPIRWPCTPSTAPPRAC1*AT 

AP*QPQPAPHQQTAALFWASPPPSH 

QAPWIPPFLPQPPS/LPPPPPHSPRGP 

"on, a rinnn a p a p vpu/t? pwt a or *n 

WTPPG AGLELGGSPGL WG HRRARE 
GGAGEGRGFPEDRTGRFYKRIFVGR 
rvnwf pnpRn^FR^F<i<TKTFi pf 

1017 

/'CIA 

6514 

B 

1059 

lo/ 


lVl/\OVJOlN W JLOVJ V IN V VLV IVLrV I UOL V 

FVT T FTFVKROTMRFAMKSRRGPHV 
PVGHNAPKRSHFILK* 

1018 

6515 

A 

1060 

67 

458 


1019 

6516 

A 

1061 

164 

528 


1020 

6517 

A 

1062 

203 

364 


1021 

651S 

A 

1063 

103 

1019 

GNGRGAPGDPCAVASAEPGLTSQD 

SGVNPNNSARGREAMASGSNWLSG 

VNVPLVMAYGSL/DVCTAIYFCEEA 

NHALCNEISKGTSCPCGTQCPQGT* 

K*EIDIRLSRVQDIKYEPQLL\ADDD 

A\RLLTTGKPRGNQSC\YNYLVIGM\ 

KALGLPFRTS\EIPFHS\EGRHSPFP* 

WGKNFRSYLL\DLRNTSTAFQGCTA 

KHLIDTLFGMAMETVARYGDKGVF 

U/pniV/ffcTVI RVflFAT QFI ATAVKART 

GELFSDIH/HVQAAKDLTQSPEVSPT 
TIQVTYLPSSQKSKRAKHFL\ELKSF 
K\DNYNTL\ESTL 

1022 

6519 

A 

1064 

1027 

1365 

PEVNRLYCLFKNK1*KALLSFQTYIC 
rv\/i xwn TPcv^v>f\/PV^y^POVWP\A^ ,, 

1 Y V LU V LilKtJSJVl V r IVlvU^Vc VV VLV 

IYMCVCVC11H*CVCVCIYIHTHTHT 
C/VCDW*AIQ**TCPHYFFLILDQCC 

1023 

6520 

A 

1065 

675 

819 

HRLDRAHP*RAEGNCLLPIVYLSY/G 
PLIA*TGQGTSSPCLCSL/*PRSAITHT 
PSQPGDPRQPQTVHSGELNPRVYTK 

1024 

6521 

A 

1066 

3 

603 

VDDFVQPARRRWEMLGVLPSVFLG 
LRRFVHPAKGMKQTRGDSFAFQSG „ 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 

III UUOl V 

09/770,160 

Nucleotide 

InratSnn ai 
lUCilllUlI \Jl 

first codon 
for peptide 
sequence 

Nucleotide 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *«Stop 
codon; £=possible nucleotide deletion; \=possible 
nucleotide insertion) 







SPGVCFEGQEAEGSLSYGVGL1AAG 

FVLLSPPSQC/HDSLATQVLVCGVA 

LLWPSSGRAGTVQP*PAPENRSASP 

FCLPGHIQPVPVFTVPRSAITHTPSS 

NLGTPRQPQTVPLRGAESPGQPCPM 

SLRKLPQARPLVLYTCSCHPAVDEC 

FR 

1025 

6522 

B 

1067 

46 

1983 

MRPRKAFLLLLLLGLVQLLAVAGA 

EGPDEDSSNRENAIEDEEEEEEEDD 

DEEEDDLEVKEENGVLVLNDANFD 

NFVADKDTVLLEFYAPWCGHCKQF 

APEYEKIANILKDKDPPIPVAK.IDAT 

SASVLASRFDVSGYPTIKILKKGQA 

VDYEGSRTQEEIVAKVREVSQPDW 

TPPPEVTLVLTKENFDEVVNDADIIL 

VEFYAPWCGHCKKLAPEYEKAAKE 

LSKRSPP1PLAKVDATAETDLAKRF 

DVSGYPTLKIFRKGRPYDYNGPREK 

YGIVDYMIEQSGPPSKEILTLKQVQE 

FLKDGDDVIIIGVFKGESDPAYQQY 

QDAANNLREDYKFHHTFSTEIAKFL 

KVSQGQLVVMQPEKFQSKYEPRSH 

MMDVQGSTQDSAIKDFVLKYALPL 

VGHRKVSNDAKRYTRRPLVVVYYS 

VDFSFDYRAATQFWRSKVLEVAKD 

FPEYTFA1ADEEDYAGEVKDLGLSE 

SGEDVNAAILDESGKKFAMEPEEFD 

SDTLREFVTAFKKGKLKPVIKSQPV 

PKNNKGPVKVVVGKTFDSIVMDPK 

KDVLIEFYAPWCGHCKQLEPVYNS 

LAKKYKGQKGLVIAKMDATANDV 

PSDRYKVEGFPTIYFAPSGDKKNPV 

KFEGGDRDLEHLSKFIEEHATKLSR 

TKEEL* 

1026 

6523 

A 

1068 

1 

849 


1027 

6524 

A 

1069 

74 

2705 

KKLDFFRSLPVFQADFSHWQLFRVL 

FLLHPPLVISMDSWFILVLLGSGLIC 

VSANNATTVAPSVGITRLINSSTAEP 

VKEEAKTSNPTSSLTSLSVAPTFSPN 

ITLGPTYLTTVNSSDSDNGTTRTA\S 

TNSIGITISPNGTWLPDNQFTDARTE 

PWPGNSSTAATTPETFPPSGNSDSK 

DRRDETPIIAVMVALSSLLVIVFIIIV 

LYML/RGFKKYKQAGSHSNSF\RLS 

NGPH*GMWEPQSVPLL\ARSP\STNR 

\KYPTPGPWDKLEEEINRRMADDNK 

LFREEFDALPACPIQATCEAASKEE 

NKEK\NRYVNILPYDHSRVHLTPVE 

GVPDSDYINASFISGYQEKNKFIAA 

QGPKKETVNDFWRMIWEQNTATIV 

MVTTLKERKECKCAQYWPDQGCW 

TYGNIRVSVEDVTVLVDYTVRKFCI 

QQVGDMTNRKPQRLITQFHFTSWP 

DFGVPFTPIGMLKFLKKVKACNPQY 

AUA1 V Vrlv^bAO VuRTXb 1 FVVIDAM 

LDMMHTERKVDVYGFESRIRTQR* 

QMVQTDMQYVFIYQALLEHYLYG 

DTELEVTSLETHLQKMYNTIPRTNT 


WO 01/79449 PCT/US01/086S6 


SEQID 
NO: of 
nucleotide 
sequence 

SEQ ID 
IN u: oi 
peptide 
sequence 

Me 
tno 
d 

SEQ ID NO: 

: „ ■ i com 

09/770,160 

Nucleotide 
location of 
first codon 
Tor peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Un known; *=Stop 
codon; ^possible nucleotide deletion; ^possible 
nucleotide insertion) 







NGL* EEFKKLTSIKIVQNDKMRTGNL 

PANMKKNRVLQII\P*EFTRVIIPVKR 

GEENTDYVNASFIDGYRQKDSYIAS 

QGPFS\HTMRDFWRMIWSGKSCSIV 

MLTELEERGQEKCAQYWPSDGLVS 

YGDITVELKKEEECESFRVRDLLVT 

NTRENKSRQIRQFHFHGWPEVGIPS 

DGKGMISIMAVQ\KQQQQSGNHPI\ 

TVH\CSPGGKERTGTFCA\LSNVL\E 

RVKA\EGILDVF\QTV\KSLR*\QRPQI 

GSRQLEQY*VLAYKVVPGVILDAIP 

QINAQLQSKAANKGPVDPGGLPFNI 

LVIFLFC 

1028 

6525 

A 

1075 

734 

1151 

YRRGPGGLRWAEMSGDFPPIPLPVR 

GIHPIPLRASQ/PVL*GGQQGMGTGP 

ISQ/PGETEFQTGLSACPKPHRVPGP 

SSCTTEKPSQRLHEQM VRGG* SSMG 

GAGNGVGMESGTVQGTPSGSGWR 

PAGTGVGARNCWYLPL 

1029 

6526 

A 

1076 

118 

399 


1030 

6527 

A 

1077 

1 

214 

LLMRVSLPSEVFFCVVFETESRFVT 

QAGVQWHDFG*LQPPPPRFK*FSCL 

SPPSSWDYRHVPPCLANFCIF 

1031 

6528 

A 

1078 

2 

152 

ETESLYVTQAGVNQWHDLGSLQPPP 
PRFK*FSCLSLPSLTTFDTSLKSME 

1032 

6529 

A 

1079 

2 

426 


1033 

6530 

A 

1080 

1 

1716 


1034 

6531 

A 

1081 

2 

886 

VGGRGEALDGGGSGAPPSVSQTES 

RAGTMSAYPKSYNPFDDDGED\EG 

ARPAPWRDARDLAPDGPDAPADRQ 

QYLRQEVLRRAAATAASTSRSLAL 

MYESEKVGVDSSEELARQRGWLEA 

HREDGGTRLDQDLKISQKHINSI*ER 

VLGGLVNA r TFKSKPVE\PPPE\QNG 

TLTSQPNNRLKEAISTSKEQEAKYQ 

ASHPNLRKLDDTDPGPPEAWASAP 

GVLMLTPKNPHLRA\YHQKIDS\NL 

DE\LSMGLG\RLKDIALGMQTEIEEQ 

DDILD\RLTTKVDKLDVNIKSTERK 

VRQL 

1035 

6532 

A 

1082 

1549 

1712 

SNL*FFFFEMESRSVA\RL\ECSGVIS 

A\HCNLCLPGSSNSPTVSAS*VAGITG 

A\THHSRLLFVFLG\ETGF\HHVG\QA 

GL\DSLTLMIQPALASPKCLGLQAVS 

PPMPSPYSSSFFCPLNILTPHVLYPG 

LNPPSSFCSDL 

1036 

6533 

A 

1083 

2 

336 


1037 

6534 

A 

1084 

218 

1080 

PSSRMNH\LPEDMENALTGSQSSIAS 

LRNIHSINPTQLMARMESYEGREKK 

GIS\DVREDFCLFVTF\DLLF\VTLLW 

D**EFKC*MGGHLRNTFRRRR*LQY 

* LTISSIFLILFLPGQFFRF*KCLILAY 

AV\CR\LRHWWGQ*RLTTAMTSALL 

LAKVILLKLSSQGAFG\YVLPFI\SFIL 

AW\IETWFL\DFKVLPQE\AEEENR/L 

SLIVQDASERAALI\PGWSFWMGQF 

YSPPGIRRQDLEGSLKEKQGQLRKP 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 
tno 
d 

SEQ ID NO: 

in USbN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Un known; *=*Stop 
codon; £=possibIe nucleotide deletion; V=possible 
nucleotide insertion) 







LFRNYEYYFLFKCGKTLTESHRGKK 
RQAVESPCRQ 

1038 

6535 

A 

1085 

3 

400 


1039 

6536 

B 

1086 

333 

436 

XTPVFSYGDEIGLDAAALPGQPHEG 
LLLRFPYAA* 

1040 

6537 

A 

1087 

127 

2041 

RGVWGGHVPGGSREEWSGEDQGG 

KRRGDAVAENCAEAREAGESVLGP 

RVQVGVEPPSDRKLRRVAGSAGTM 

SQDTEVDMKEVELNELEPEKQPMN 

AASGAAMSLAGAEKNGLVKEIFPQ 

KKKGGGSAEDEAEAAAAAKFTG\L 

SKEELLKVAGSPGWVRTRWALLLL 

FWLG\WL\GMLA\GAWIIVRAPR\C 

RKL\PAQKWWHTG\AL\YRIGEPLRP 

FOGQRRGANLAGSLKGRLDY\LSSL 

KVKGLVLGPIHKNQKDDVAQTDLL 

QI\DPNFGSKEDFRSLL\HSVKTKNL 

RGILD\LTPNYRG*ELRWFSTQVDT\ 

VATKVKDALEFWLQAGVDGFQVR 

DIENLKDASSILGLSWQN/ISPKGFSE 

\DRLLIAGTNSSDLQQILSLLESNKD 

UXTSSYLSDSGSTGEHTKSLVTQY 

LNATGN\RWCSWSLSQAR\LLTSF\L 

PAQL\LRLYQ\LMLFTL\PGTPVFQAT 

GNEMGLGCSCPLLGQPMGGSQFML 

W\DEVPAFP\DIPGGC*VANMDCGR 

GQSE\DPGSL\LSLF\RRLN\DQRSKE 

APPYCHG\DF\HAFLPLGPWTLSPIIR 

QLGTQNETFSG*CLNLGDVGLS\AG\ 

LQASDLP\ASAKPWPAKADLLLSTQ 

PG\REEG\SPLELERLKLEPHE\GL\LL 

RFPYAA 

1041 

6538 

A 

1088 

652 

905 

HLLPPLTPTTTQWGRDLLLSPISQM 
RKLRHRKVKKKTRT/WPGVVDHPL\ 
NLSTLGGGAWRIA*GQEFETSLGNI 
ARPCLYKKKFK 

1042 

6539 

A 

1089 

3 

591 


1043 

6540 

A 

1090 

266 

1905 

LGGHTWGTAAAGVWSDWPGRSW 

AELTSENSAGLSPSWGSPQDEVPGA 

WPMLQGAVEPMQIDVDPQEDPQN 

APDVNYWENPSLDLEEFAASYSGL 

MRIERLQFIADHCPTLRVEXALKMA 

LSFVQRTFNVDMYEEIH\RKLSEAT 

RSSLRELQNAP\DAIPESGVEPPAL\D 

TAWV*\VTRKKALLKL\EKLDTDLK 

NYKGNSIKESIRRGHDDLG\DHYL\D 

CGDLSNALK\CYF\RPRDYWTSAKH 

VINMCLNWIKGQRFTLQNWSHVLS 

YVSKAESTPEIAEQRGERDSQTQAIL 

TKLKCAAGLAELAARKYKQAAKC 

LLLASFDHCDFPELLSPSNV\AIYGG 

FCALVTF\D\RQELQRNVI\SSSSFQV 

VLGSWEPQV\RDIIFKFYESKYAS\C 

LRRLDEMKDNLLLDMYLAPHVRTL 

YTQICN\RALIQYFSPYVSPDMHRM 

AAAFNTTGGPPWKNELIKFILVEGLI 

SARVD\SHSKIL\YARDV\DQRRTTF* 

ESLCLMG\KEFQRRAKA\MMLRAA 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







V\LRNQIHVS YPTQRKDARG ELTPA 
NSQSRMSTNM 

1044 

6541 

A 

1094 

2 

397 

SQDHVRGFLEKESAIVSRPLNPFTA 

KALSGTSPGDSDDVQPGPSVGPPSK 

DKDKVLPSFWIPSLTPEAKATKLEK 

PS/RLECVEKLIRKDMVDPVTGDKL 

TDRDIIVLQRGGTGFAGSGVKLQAE 

KSRPVMQA 

1045 

6542 

B 

1095 

30 

310 

MTRHGKNCTAGAVYTYHEKKKDT 
AASGYGTQNIRLSRDAVKDFDCCSL 
PRSCCHAYEKQRGTRREEQKELQR 
AASQDHVRGFLEKESAI* 

1046 

6543 

A 

1096 

29 

449 


1047 

6544 

A 

1097 

2 

1069 

IETRCTPRCRNSARGESLVRMTRHG 

R\N\CTAGAVYTYHEKKKDTAASGY 

GTQNIRLSRDAVKDFNDCCCLSLQP 

CHDPVVTPDGYLYEREAILE\YIL\H 

QKK\EIARQMKGLTRKAGGGTRPPK 

EQKELQR\AASQ\DHVRGFL\EKESA 

IVSRPLNPFTAK\ALSG\TSPGDSDDV 

\QPGPSVGPPSKDKDKVLPSFWIPFA 

TPRAKATK\LE\KPSRTVTCPHVKGS 

PLR\MSEPERPVHFHNR*NSS\VEPR 

GASITPQASAYVCA\VTR\DSLSKRQ 

PPVAVLRPSGAVVTLECVEVKLIRKD 

M\ADPVTG\DKLTDRDIIVLASGAVT 

GFAGSGVK\LQAEKSRP\VMQALRC 

AGGPNKPGLGP 

1048 

6545 

A 

1098 

5 

576 

SRVVEFAKMAENSGRAGKIIRDDSG 

VKGAVSHEQVIAGLQTTFGRNQRG 

LASQS/AAELGDWKLNEATALVIDT 

TGRE V\DET\RKC YRMV WKEFLVEA 

NLSKEVACPAFGRTTKEQ\IQKII*DT 

*HSSFQAKGKELK*ISGKKHNI/RVL 

MGEDEKPSQPKENS\EGGLGLKAS\S 

AGWWSPRDQGLCIFFLP 

1049 

6546 

A 

1099 

534 

1004 

RMSAGALFIWGTAI\YFDRKKTEVT 

PNFQEPGFRERRKKQKLAQGEKLG 

FPK\LPD\LKDAEAVQKFFL*RNTSL 

GEEL\LAQ\GEYEKG\VD\HLTKPELP 

VCGQPTASLLQVL/QQANFFPPPV\F\ 

QMLLD* SSPTISQRIV\SAQSLAE\DD 

VGMRNKCLH 

1050 

6547 

A 

1100 

91 

942 

GLLVGVGAAAVMPGIVELPTLEEL 

KVDEVKISSAVLKAA\AHHYGA\QC 

DKP\NKE\FMLCRWEEKDP\RRCLE\ 

EGKLVNK\CALE>FFKA* *NRHCAE/P 

LFLQEYWGLCIDYTRPSKLFPSTVR 

KQAGKSFDEC\VLDKL\GWVRP*PG 

ENCQKVT\KVKTDRPLPENPYH\SRP 

RPDPSP\EIRGEILQP\ATHGS\RFYFW 

TKLKMGPVAHTRSCAQTTTDENAH 

AGLHPTDSGVLSGIHQHLTKKLTYD 

DLAVILYHFLSIKIFKGEAPPLQHYP 

QSHQTTLCSPQNPG 

1051 

6548 

A 

1101 

140 

812 

GDFGDRAGAGRETEEIFHSSQQLKI 

RPWAGAGRAAEPKDWR1WGTGEW 

GSERQIPSPPVPSRSNPHFLPQAGAG 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Dnknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







HFVQNPCKYPTPSPLQRSLEELPLSP 

TVFKLLHPDPPPCKAPLTLQPVAPK 

* I * ESPPLPPMLDPRDFPSPAPHPANE 

LKKRRGNPRQWFFLSEPRLSHLQN 

GTGWGWKGKDGRGRRTGMELGLI 

PEAPVPISGPPFIHSHSQPPCYTGL 

1052 

6549 

A 

1102 

228 

860 

STAQGNLLTVFIQPRASMSGGKYW 

DSEGHLYTVPIREQGVNIYKPNNKA 

M\ADELSE\KQVYDAHTKEIDLVNP 

RPLNIFNG*PWSKIDFKDVIAEPEGT 

HRF*RAFGKASFHHLSL*RKYWFLP 

LCWSALFGHPRWALIWGHFTSANS 

LLFLAHL/WAWP\CIK\SFLI*GFQCI 

SR/VSYSILRSTTGLVTPLFEAVG\KI 

FQQLSASNLQKEI 

1053 

6550 

A 

1103 

825 

920 


1054 

6551 

A 

1104 

222 

1244 

RWEKKMALLCYNRGCGQRFDPET 

NSDDACTYHPGVPVFHDALKGWSC 

CKRRTTDFSDFLSIGGCTKGRHNSE 

KA\PEPVKPEVKTT\GTKELCELKPK 

FQEHIISSPLSQ*KQLKRPSPDEPMT 

NLGIKNIWPPLKQALDKLKLSSGNE 

ENKKEEDNDEIKIGTSCKNGGCSKT 

YQGLESLEEVCVYHSGVPIFHEGMK 

YWSCCRRKTSDFNTFL\AQEGCTKG 

KHMWTKKDAGKKWPCRHDLHQT 

GGEVTISVYAKNSLPELSRVEANST 

LLNVHIVFEGEKEFDQNVKJLWGVI 

DVKRSYVTMTATKIEITMRKAEPM 

QWASLELPAAKKQEKQKDDTTD 

1055 

6552 

A 

1105 

87 

313 

ISQERG *RRDKERLAQREIK/RRRER 
EK/ER* EERIDKKREAKREKR/ERER 
KIPEEREEERKKGIFVFIWFNPMSVP 
H 

1056 

6553 

A 

1106 

37 

404 

PQLSRCRSECMYVNPTWMTSMGQ 

ATWSDPHKAKTMLNRIPLGKFAGE 

SGGSPASVVPAVPVCALGRGGRER 

WAAASFLYAPDPRPAH\EVEHVVN 

AILFLLSDRSGMTTGSTLPVEGGFW 

AC 

1057 

6554 

A 

1107 

19 

919 

AVWWNSELFLAGRRVLVTGAGK/G 

WAAGKGGQRPAAGRGGQGTPSLSP 

LPAGIGRGTVQALHATGARVVAWS 

RTQADLDSLVRECPGIEPVCVDLGD 

WEATERALGSVGPNVDLLVNNAAV 

ALL\QPFL\EVTKEAF\DRVCPSASRS 

FE\VNLRAV1QVSQIVARGF/I*ARGV 

PGAIR/VNVSSQCSQRAVXTNHSVYL 

LPTKGVPLDMLDQG*WAL\ELGPH 

KLSRCRSELNASKP/TTVGD*RSMG 

PGPPWSDPHK\AKIMLNRIP\LGKFA 

EVEHVVNA\ILFLLSDRSGMTTGS\T 

LPVEGGFWAC 

1058 

6555 

B 

1111 

28 

384 

MKAAVLTLAVLFLTGSQARHFWQ 

QDEPPQSPWDRVKDLATVYVDVLK 

DSGKDSVTSTFSKLREQLGPVTQEF 

WDNLEKETEGLRQEMSKDLEEVKA 

KVQPYLDDFQKKWQEEMELYRQK 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleotide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in I ISSN 
09/770,160 

Nucleotide 

1 Adit/Ml Al 

first codon 
for peptide 
sequence 

Nucleotide 

IVLUUiJU Ul fuM 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







# 

1059 

6556 

A 

1112 

44 

1009 

GGIREGGPPRPFRMKAAVLTLAVLF 

LTGSQA\RHFWQGG*SPPRAAWDR 

V\K\DLATRVPWTVLKEQRTETYVS 

QFEGLRLGENS*TLKLL\DNWGQR* 

PSTFQPSCAKQLGP\LTQEFWYNLE 

KETEGFRQEMSKDL\EEVKAKVQP/ 

YTLDDFQERSWQEE\MELYRQKV\E 

PLARKNFQEGVARPESLHELARRSLS 

PLGEAVSRPRARPMWDALRT\HLAP 

YSDEMMPALGRAPLGALRENGGAR 

MGQYHAXQATEHLSTLSEKAKPALE 

DXLRQGLLPVLESFKVSFLSALEEYT 

KKLNTQLRRPPPPPYPVLRINVSKV 

EKKKKK 

1060 

6557 

A 

1113 

62 

393 

IPAKQPTPTSLKTPTEECDQHENTAS 
SPSPMTPPCT/PSTNQPSPKLPVSHSP 
NP*KPPAPKLLREMDLTFPPHFPPSV 
APTMKPLSSATTPMPRRISLSGSHSR 
RWDPFVG 

1061 

6558 

A 

1114 

3 

450 

QTQREPTMVLSPADKTNVKAAWG 

KVGAHAGEYGAEALER\MFLSFPTT 

KTYFPHFDLSHGSAQVKGHGKKVA 

DALTNAVAHVDDMPNALSALSDLH 

AHKLRVDPVNFKLLSHCLLVTLAA 

HLPAEFTPAVHASLDKFLASVSTVL 

TSKYR 

1062 

6559 

A 

1115 

9 

675 

NSARATDSERTHHGARLLPDKTNV 

KA\AWGKVGAHAGEYGAEALERM 

FLSFPT\TKTYFPHFDL\SHG\SAQG* 

RAHGK\KVA\DALTVNAVA\HVD\DM 

PQTALSALSGPATAHKL\RVDPVQL 

SSS*SHLPCWWTLGRPTSPSEFNPW 

RLHAFPGTKFPGLLVEAPLLEPSKLP 

LKLGSLRVGHAFFAPLGLPPRALLP 

FPGTRNPVGLLNK1LNWGGKKKKK 

KKIF 

iUOJ 

OjOu 

t> 

I 1 1 C 

I I Id 

61 

348 

ESALTQLLKAGGSLKKFLFHPGDTV 
PSTARIGYEQLALGVIAAGAGAIVH 
EKHPGKLAGYISSLLTLAGFATAMA 
AVVLCVNSFIWQTEPFLYIDT* 


OjOI 

A 
/\ 

1 1 1 7 

111/ 

2. 

256 

CLSCAFWAGSWIAAGAGAIVHEK 
HPGKLAGYISSLLTLTGFATAMAAV 
VLCVNSFIWQTEPFVLYIDTVCDRSD 
PVFPTTAIVL 


ODD/. 

A 

1110 

1 1 lo 

3 

270 

AVVLCVNSFIWQT/EPFLYIDTVCDR 
S/DALFLAVGVLKVIVSLVSLGVGL 
RNLCGQSSQPLNEEGSEKRLLGENS 
VPPSPSREQTSTAIVL 

1066 

6563 

A 

1119 

1 

642 


1067 

6564 

A 

1120 

46 

998 

AIVPSWDLDKDTISLLSPVLCIFPSPS 
SQTSLLYVFSLAGRMTQNTVIVNGV 
AMASRPTQPTHVNVHIHQESALTQL 

I V AM/^OT LEVITT T7i_iD/^r\'T , \/r>cnr A DTP 
Ijlv/\vj\vJoLKls.r LrrlrOD I Vro 1 AKI(j 

YEQLAL\GVTQIFAGALRGVIL\GVC 

♦SWGPGTVLRASGCAFWAGSWIA 

AGAGAIVHEKHPGKLAGWISSLLTL 


WO 01/79449 M PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location or 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=l)n known; *=Stop 
codon; Impossible nucleotide deletion; V=possible 
nucleotide insertion) 







AXGFATAMAAVVLCVNSFIWQTEPF 

LYIDTVCDRSDPVFPTTGYRWDAA 

KSRRNQWQKEECRAYMXQMLRKLF 

\TAIRALFLAVCVLKVIVSLVSLGVG 

LRNLCGQSSQPLNE/EKGSEKRLLG . 

ENS\VPPS\PSREQTSTAIVL 

1068 

6565 

A 

1121 

504 

1026 

KIKPvKKKRPHIPVLREIPGLLNIPCL 
WHLEWVTLSKSPSKRFCAFAVGGN 
GGESGWLGGTRP/PSPRGMHLPGSS 
SSESEPHRNCPCPGSAQPCGHQAGS 
EDPQNTGPVAS/EL*PPACWRLCGQ 
. PGPIAGAPAAPLAGHPRPPWRQVGP 
GTSGSSQSWVSSCDHGGQHSGQHQ 
SWQ 

1069 

6566 

A 

1122 

461 

548 

KNLEQKNAMIHSAGEHHQGAERRS 

TWEELEGPRVTSLTLLV*RAWSSGP 

APSPT*PPSCTPPRRSS*APAGPSDAS 

PSRRPRA/SPASRQAAPKDKLPETPR 

RRIEKEPXPGPFAPGIFE/GGFTVSGG 

REQETPFAGTSGCY/RPTPPHFCWL 

GSPPRSTSSKAGGPSSSPPSPPTEAES 

STARPAKSRTMPTSGWHIGSTRPPR 

RRPSPR*RTSCTAHRRTSCSFSGTMP 

ARSRGLHSEIRRLQQHCTDLTYELT 

VKCSEQTGDGTSKSSELKKRCEELE 

AQLKVKENENAELLKELGAEKRDD 

SQCWRTPSRSREKKYLGGA 

1070 

6567 

A 

1123 

148 

197 

DPLGFL*QKRNQQEDD 

1071 

6568 

A 

1124 

1333 

2383 

RMKKEHVLHCQFSAWYPFFRGVTI 

KSV1LPLPQNVKDYLLDDGTLWSG 

RDDPPTHSQPDSDDEAEEIQWSDDE 

NT\ATLT\APEFP*SLPLKVQGSYQIP 

LGGQVSFPKV*FGS\APRD\AYWIA 

MNSSLKCKTLSDIFLLFKSSDFITRD 

FTQPFIHCTDDSPDP\CIEYELVLEK 

WCEMIPGGNEFRCFVKENKLIGISQR 

DYTQYYDHISKQK\EEIRRCIQDFFK\ 

KHIPVQIL*MKDLVFD1YR\DSRG\K 

VWLIDRNPFGEVTDSLLFTWEELI\S 

ENNFKRRFLVKVDAQEQDSPSFSVA 

QTSEVTVQPQPLICSYRLAPKDFVDL 

SVTGEDAHKLIDFLKLKRNQPEGR 

1072 

6569 

A 

1125 

162 

413 

GADGQINQQTLDDRSGDECLDECP 
GPPRRGKGPQREVQPASPQASPPGT 
HQ*GSDGSSCTPSPV\SIGSPGLGPPI 
WRPHPKPG 

1073 

6570 

A 

1126 

2 ; 

228 

GGPRNKEPYPQGPKNRAQSPKNLV 

HSLTSM*SSP/LPFKPSKSTIIDNCPLT 

HQ*NPLKP*PLFPPSPNIPPGFKKP 

1074 

6571 

A 

1127 

302 

488 

SPICLTPKSSLTHSSDDYKYSVWQR 
AVAHTCNPSTLGAEAG/WVT*AEEF 
KTSLHNIVRPCLY 

1075 

6572 

A 

1128 

66 

703 

RRRRLPSVAIMIILPGPSSSHDEMRS 

D1YKMRGDRGRGLCLEGGRGRWVS 

RTE\GTID\DSLIGG\NA\SAESPRGAK 

GTERHK*STGV\D1VMNHHLPGNKF 

SQKEASKKVHQRITMKSI\KGKP*K 

NRRPRKSKTFL*QGAAEQIKHILAN 


WO 01/79449 


SEQ ID 
NO: of 
nucleotide 
sequence 

SEQ ID 

NO: oi 
peptide 
sequence 

Me 

flirt 

mo 
d 

SEQ ID NO: 
in uooii 
09/770,160 

Nucleotide 

lUCaMUIl Ul 

first codon 
for peptide 
sequence 

Nucleotide 

Inrntinn nflnQf 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 

LUUtlllt ' (JU55IDIC nilCICUIIIIC aClc[lUn v » — ptJaMUIC 

nucleotide insertion) 







FKKLQFA r FIGENMNPRWAWVLLL 
DY\*RDGVPP\YMIFFK\DG\LEMEK 
MLTNVA1ILDLSPCHP 

1076 

6573 

A 

1129 

1 

1006 

LLLLGHASTSTGGGGKRASKMATT 

KRVLYVGGLAEEVDDKVLHAAFIP\ 

WGDITD1\QIPLDYETGEK\HRGFAF 

VEFELAEDAAAAIDNMNESELFGRT 

IRVNLAKPMRUKEGSLARPACSDND 

WLKKF\SGKTL*ED*RGRRVRASQS 

RDPG/TGRAHLLKRRRSNPQVYWVD 

IKIGNKPAGNRIQMLLRSDVVPMTAE 

NFRCLCTHEKGFGFKGSSFHRIIPQF 

MCQG\GDFTNHNG/TLGGKSIYGEE 

VSMMbNFILKAYGDQGLLS/MLAC 

GDPNTNGSQF\FLTC*\KTDWLGWA 

SHVVFGEVHRKALGCLCGQIEAQG 

SKDGKPKQKVIIAGLWGSTC 

1077 

6574 

A 

1130 

1 

574 

DTRFLERLRLSISFLVQTPIGHSTEED 

QGLLSTSLWGKWKCGKNAGRKKP 

LGKAPLVVLPPWDPKRFL*KSFGQT 

LSLCPLPHPWGKPPKSQRHHGKERC 

LTrPWGDAHKArLDDrQKArFAQA 

*VNLH\CDKPAMWDPENFQAPGEM 

LLVTVLAAIPFSGKEFHPWRLQGFLG 

RKMGDLELASALVPSRYH 

1078 

6575 

A 

1131 

200 

740 

HGSMRRLLIPLALWLG\AVGVGVSQ 

I*ENPSPGGLQVALEEFHKHP\PVQW 

AFQH/TAVLESAVDTPFP\AGIFVRL 

E\FKLQQ\TSCPEEGTWKKPRVQKSR 

PQWDGNRKLPWPCIQTWALEDK.SS 

WARLVPPPPIKTQVLAGGWRSTQE 

DPSCLRVQRAC*RTPPSFYFPGQFAF 

SKXALPRS 

1079 

6576 

A 

1132 

79 

933 

EWPSSIDLVNELQVGISEKVSFLNR 

KIKPQVPLWYRLDGKV1ILTAAAQG 

IGQAAALAFAREGAKV1ATDINESK 

LQELEKYPGYSK/PRVLDVTKKKQI 

*SSLPMKLRDFDVLFNVAGFCPS\RE 

LVLGL*GRKDWGLLR*ISMWRSTY 

LMDSRAFLPKML/RFRNLGNIINMS 

CCGLPSVKGWEQDVCTAQPKASR 

OLAa 1 K.o VOCKrHaKt^unouANlA^ 

VARGTVDTPISYKKEYNATRKS*TT 

ARE*FP*RDKKPGKIPQLPEEISHAL 

KKJ oA oUMbo A I V 1 W 

1080 

6577 

A 

1133 

1601 

1778 

MEQ1RASGKLYKSLYQKREIYIYMC 

STTYIYT*Y/HSTAYIYICLYVHIYIY 

MYRQYYVF 

1081 

6578 

C 

1134 

58 

394 

MAEKPKLHYSNIRGRMESIRWLLA 

AAOVbrbxiKi'ilioAblJLUl^KNlJvjY 

LMXQQVPMVEIDGMKLVQTRAILN 

YIASKYNLYGKXLKGESPFNLREQD 

AKXCLDPRGNPKIX* 

1082 

6579 

A 

1135 

133 

985 

RNLRGIAILAGKPQVQFFHSRG\RM 

EDLDKFR\NDGYFDVSSKCPMV*DL 

MGLKLWQTRAILNLHLPANYNL/H 

YGGKDIKGREPLI*YCILGRY*PDFG 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQTONO: 

inUSSN 

0<V770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence^ X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; V=possible 
nucleotide insertion) 







VEIV/LLLSCPYCSTLREQDCQALPW 

IKEKN*KNRPTFPCPLEKVLKRAHG 

TKTY\LVGKKS*ARAGPFH\LVELSS 

TTVEELDFQSLSPSFPLPERPLETPES 

RQPCPHSERKFPYSPGQPQGKPSPW 

DEE/SL*EGRQGKDFSGFLINGSPWE 

GPRNLAINPMVS 

1083 

6580 

B 

1140 

866 

942 

MDRHCPKLSGAPFGPPAPILGLTDP 
EFSHEPKLHHARILHRAPPPTRDHP 
VGVISRLPRAGRGRAEGSPPGPDLF* 

1084 

6581 

A 

1141 

405 

536 

KSAPRPGVVAHTCNPSTLAGRG/G 
WIT*GQEFENSLANIVKCCFY 



A 

1142 

49 

365 

TPDKPIRSHETLPIHEK*PRGKTGPPP 
DSDDPPPGSPSPQHVGNSQKINSKLP 
ISSGDHSPNPYHW*CPLPSVLGIP/PV 
RRDPLCGPESPQEEGGQQRNESFDIF 

1086 

6583 

A 

1143 

3 

452 


1087 

6584 

A 

1144 

9 

486 

NSARATDSERTHHGACLLPDKTNV 

KAVAWGKVGAHAGEYGAEALERM 

FLSFPTTK\TYFPHFDL\SHG\SAQG* 

RAHGK\KVA\DALTKRRGATWDDM 

/PQTALSALSDLH\AHKL\RVGPGSTF 

KLL\SQLPCLGEPWAAHLPA\EFQPL 

AVARLPWNKVSWGFC 

1088 

6585 

A 

1145 

1890 

2027 

KCLCPPR/RCPQPLTPYPC*GVKCPP 
SEIKYKP*MCPIGCPKPSIQC 

1089 

6586 

A 

1146 

1 

903 


1090 

6587 

A 

1147 

1 

1131 


1091 

6588 

A 

1148 

1 

1376 

f 

WALPAGFDGVMSHRKFSAPRHGSL 

GFLPRKRSSRHRGKVKSFPKDDPSK 

PVHLTAFLGYTPCIAAHIVREVDRPG 

SKVNKKEVVEAVT1VETPPMVVVGI 

VGYVETPRGLRTFKTVFAEHISDEC 

KRRFYKNWHKSKKKSFTKYCKKW 

QDEDGKKQLEKDFSSMKKYCQVIR 

VIAHTQMRLLPLRQKK\AHLM\EIQ 

V\N\GGTCARESWDWPPREGLKQQ 

VPV\NQVF\GQDENDRTSSGVTQGP 

KGLQRGSPSRXWHTQESCPRKDPPE 

GLRKWACIGAMAIPARVALPLWQR 

AGQEKATHH\RTEINKKIL*DLAQG 

Y\LIKGGI07VSKNNASH*PMTLSDK 

SIVNPLGGFVH\YGK*TNDFVTFKSL 

VWLGPKKRVLT\LRK\SLLVOTKAA 

GLWRRITLKFMDTTSKFGHGRFQTM 

EEKKAFMGPLKKDRIAKEERSLMP 

GTDFASWWGLNKSYFPLKKKKK 

1092 

6589 

A 

1149 

3 

497 

PTLLVPTDSERTHPWLLSPADK\TN 

VKA\AWG\KVGAHAGEYGAEALER 

MFLSFPTTKTYFPHF\DLSHG\SAQV 

♦GPRARKVADALYTNAVAORGTDIA 

QRACPPLSDLH\AHKL\RVGPGSTFK 

LLKATC/HCLGEPWAAHLP\AEFQPL 

AVARLPWGQSFLGFLLKQRC 

1093 

6590 

A 

1150 

26 

508 

NSTDSERTHPWLLSPADKDQRQGP 
AWGVKVGAHAVRSMCAEALERMF 
LSFPTTKTYFPHFDLSHGF\SQV*GP 


WO 01/79449 PCT/US01/08656 


SEQID 
NO: oi 
nucleo-tide 
sequence 

SEQID 

INU: oi 
peptide 
sequence 

Me 

inu 
d 

SEQ 10 NO: 
in uooIN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







RARKVADAL\TNAVAQRGTDIAQR 
AVPPLSDLHAHKL\RVGPGSTSKLL 
KPLACLVDPGPPTSPAEFHPLRLQG 
FPGDKFPGLSVGKFKI 

1094 

6591 

A 

1151 

3 

520 

AKKHGHRGSSLTVFGGPHRLRENPP 

WCSSPADKTNV\KAAWGKVGAHA 

VRSMCAEALERNMFLSFPTTKTYFP 

HFDLSHGFCPG*RATAKKVA\DALT 

KRRGATWDDMPQTALSGPEATLH\ 

AHKAFGVEPGSTLKLL\SHLPCW*P 

LARPPSPAEFDPWRVERLPWDKFP 

WVSC 

1095 

6592 

A 

1152 

232 

783 

TCNPQTSSNGSQNSKLGRISRKWR 

MRRCFCFS1TRWCLECTSPRSRGRW 

KTDDVLLKLENMGVSELGQRI*LER 

FTKDTARFKDELDIMKFICKDFWTT 

VFKKQIDNLRTNHQGIYVL\QGNKF 

RLLTQMSAGKQYLEHAS*V/Y*HLP 

CGLIRGGLSNLGUKSIVTAEVSSMP 

ACKFQVMIQKL 

1096 

6593 

B 

1153 

28 

249 

MFLSFPTTKTYFPHFDLSHGSARLR 
ATARRWRRADQRRGHVDDMPNAL 
SALSDLHAHKLRVDPVNFKLLSHCL 
L* 

1097 

6594 

A 

1154 

3 

511 

DKTNVKAAWGKVGAHPGEVWCGP 

LERMFLSFPTTKTYFPHFDLSHGSA 

QVKGHGKNVADALTNAV/AHVDD 

MPNALSALSDLHAHKLRVDPVNFK 

LLSHCLLVTLAAHLPAEFTP\AVARP 

SLGQ VS WAFL* SNRC WTFQISLPAE 

FTPAVHASLDKFLASVSTVLTSKYG 

1098 

6595 

A 

1155 

2 

247 

PADKTNVKAAWGKVGAH/AGEYG 
AE/ALERMFLSFPTTKTYFPHFDLSH 
GSAQVKGHGKKVADALTQGELGG 
EVGGQGHQQAVA 

1099 

6596 

B 

1156 

74 

195 

MFLSFPTTKTYFPHFDLSHGSAQVK 
GHGKKVADALTNAVXT* 

1100 

6597 

A 

1157 

3 

224 


1101 

6598 

A 

1158 

3 

136 


1102 

6599 

A 

1159 

1 

371 

TQREPTMVLSPADKTNVKAA/WGM 

FLSFPTTKTYFPHFDLSHGSAQVKG 

HGKKVADALTNAVASVDDMPNAL 

SALSDLHAHKLRVDPVNFKLLSHCL 

LVTL\AAHLPAEFTLAVHAFLGQFP 

GFF 

1103 

6600 

C 

1160 

1 

X 

156 

MVRRPWRGCSCPSPPGXPPPRRVHP 
CGAXLPGQVSXFCEQRAELQIXLRL 
EL* 

1104 

6601 

A 

1161 

1 

577 

AAWGKVGAHAGEYGAEALERMFL 

SFPTTKTYFPHFDLSHGSAQVKGHG 

KKVADALTNAVAHVDDMPNALSA 

LSDLHAHKLRVDPVNFKPPRPTSRT 

ST*ATALPRLRATARRWPTR*PTPW 

PRGRHAQRAVRPERPARAQASGGP 

GQLQ\LLSHCLLVTLAAHLPAEFTP 

AVHASLDKFLASVSTVLTSKYR 

1105 

6602 

A 

1162 

1 

680 

ERTTMVLSPADKTNVKAAWGKVG 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: or 
nucteo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ1DNO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=l)nknown; *=Stop 
codon: ^nossible nucleotide deleHAii' \=nnccihlf» 
nucleotide insertion) 







AHAGEYGAEALERMFLSFPTTKTYF 

PHFDLSHGSAQVKGH/GRMFLSFPT 

TKTYFPHFDLSHGSAQVKGHGKKV 

ADAT TMA VAHX/TYnK/TPXIAI QAI QT"M 

HAHKLRVDPVNFKLLSHCLLVTLA 
AHLPAEFTPAVHASLDKFLASVSTV 
LTSKYR 

1106 

6603 

A 

1163 

2 

1758 

TMVLSP/ADKT/NVKAAWGKVGAH 
AGEYGAEALERMFLSFPTTKTYFPH 
FDLSHGSAQVKGHGKKVADALTN 
AVAHVDDMPNALSALSDLHAHKL/ 

RSRTST*ATALPQVKGYGQEGGRR 

vnop dp.ad nv u a r\v> a a/d dcdd at d 
V UyW\UAKuKnAyi\A V JKJrJDtvrALK 

NKLWVDPVNFKLLSHCLLVTLAAH 
LPAEFTPAVHASLDKFLASVSTVLT 
SKYR 

1107 

6604 

A 

1164 

121 

521 

SFNKFFKKAKAVSQKKIPATKLRDK 
GLQTKYSCLYYYFYLRHGLALSPRL 

VLKLQYFCTS**LGITGACHHASLLL 

K'FFVPTni TVT PPT VQWPWPQCPDCT 

LKCWD 

1108 

6605 

A 

1165 

407 

548 

VPFTKWHQKIEAGQAWWLMPVIPA 
VWEAE\AGGSLEIRSS*PAWATW 

1109 

6606 

A 

1166 

25 

628 

EFHRLRENPPWCLSPADKTNVK/AP 
AWGKVGAHAGEYG\SEALER\MVL 
FPPPTPKPYFPHF\DLSHG\SAQV*GP 
RARKVADAL\TNAVAQRGTDIAQR 
AVPPLSDLH\AHKL\RVGPGSTFKLL 
JSJ\ X wJlUJLAj.br WAAriLrXAJbr vJFLA 
VATSSLGTKFPGFLVEAPLLTFQITF 

IffiWl WT AIVPI PPni PPQPQCPPI 14 

PYPRGL 

1110 

6607 

A 

1167 

2 

121 

TFVRLGTLSTPLWGSYDFFFPS*FSL 
FLFYSVHFMPLAF 

1111 

6608 

A 

1168 

3 

582 

AKRELRFLLVYLHGDDHQDSDEFC 
R\SQALRENTYPFLAMIMLKDRRMT 
WGRLEGLIQPDDLINQLTFIMDAN 
QTYLVSERLEREERNQTQVLRQQQ 

KRRKEEEVQQQKLAEERRRQNLQE 

FKFRKT FPT PPFP^PnnPF^VKTTFlfT 

PNDSRVERRFHFSQSLTVRTT 

1112 

6609 

A 

1169 

30 

130 

QILLSPCLPPP*YLNKRWPEDNTCLL 
KTKLKRT 

1113 

6610 

A 

1170 

2 

f 

1412 

GIAGPTISCRGGGKMAAPEERDLTQ 

EQTEKLLQFQDLTGIESMDQCRHTL 

EQHNWNIEAAVQDRLNEQEGVPSV 

FNPPPSRPLQVNTADHRIYSYWSR 

PQPRGLLGWGYYLIMLPFNRFTYYTI 

LDIF\RFAPRFIR\PEP\RSR\VT*PRLG 

TIVSIYGTPFEEKYGRAHPVFYQGT 

YSQALNDAKRELRFLLVYLHGDDH 

QDSDEFCRNTLCAPEVIS\LINTRML 

FWACSTNKPFGYRV^OAI RFMTVP 

FL\AMIMLKDRRE*PV\VGRLEGLI\Q 
PDDLINQLTFIMDANQTYLGV/SERL 
EREERNQTQVLRQQQDEAYLASLR 


WO 01/79449 


SEQID 
NO: of 
nucleo-tide 
sequence 


SEQID 
NO: or 
peptide 
sequence 


Me 
tho 


ID NO: 
inUSSN 
09/770,160 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 


PCT/US01/08656 


Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; ^possible 
nucleotide insertion) 


ADQEKERKKREERERKRRKEEEVQ\ 
QQKLAEERRRQ\NLQ\EEKERKLEC 
LPP\EPSPDDP*KVSKIIF\KLPNDSRV 
ERRIPLFHRSLTVIHDFLFSLKESPEK 
FQIEANFPRRVLPCIPSEFAWPNPPTL 
QEVAGLSHTEVLFVQDLTDE 


1114 


6611 


1171 


232 


427 


STKISHTKKTARJNKTKGGRGQAW 
WLSPIIPTLWEIKVGGLFEPRRSRLQ 
*AVFMP\CTPSWATQ 


1115 


6612 


1172 


232 


427 


STSISHTKKTAR1NKTKGGRGQAW 
WLSPIIPTLWEIKVGGLFEPRRSRLQ 
♦AVFMPACTPSWATQ 


1116 


6613 


1173 


77 


1775 


GRKVVMDLIPNLAVETWLLLAVSL\ 

VLLYLYGTRTHGLFKKLGIAGGTPL 

PFLVNALYFRKGYWTFDMECYKK 

YRKVWGIYDC*QPMLAITDPDMI\K 

TVLVKECYSVFTNRVRPFGPWGFMK 

NAISIAEDEEWKRIRSLLSPTFT\SGK 

LQGDGPLSLPQYGDV\LVRNLRREA 

\ETGKPVTLKDVFGAYSMDVITSSSF 

G\VSIDSLNNPQDPFVGKHQGSF*GF 

NPLDPFVLLQLKVFPFLTPILEGIKY 

SLCFPRKSY*VFLNKIC*NRLKEGRL 

KETQKHRVXDFLQLMIDSHKNSKDS 

ET\HKALSDLELMAQSIIFIFAGYETT 

SSVLSFIIYELATHPDVQQKEQNEID 

TI\LPNK\APPTYDTVLQLEYLDMVV 

NETLKLFPVAMRLERVCKKDVEIN 

GMFIPKGVGVMIPSYVLHHDPKYW 

REPEKFLPERFSKKNKDNIDPYIYTP 

FGSGPRNCIGMRFALMNMKLALIR 

VLQNFSFKPCKETQIPLEI/DAVGGL 

LLTEKPIVLKAESRDETVSGSLNFPK 

DILVCSLRKLVPQKHQRPSNYFTNR 

PLKRRRGFIPNVAAIK 


1117 


6614 


1174 


403 


1118 


6615 


1175 


465 


1119 


6616 


1176 


1112 


AGEFPGQLHSRASFCTARSGSAAAL 

RMRPVRLMKVFVTRKIPRPRCRVA 

LARAADCEVEQWDSDEPIPAKELER 

\GVAGAHGLL\CLLSDHVDKRILDA 

AGANLKWISTMSVGIDHLALDEIKN 

KRGIRVGYTPRLSLTADTTAELAVSV 

LFLPT\CGRWPEAF\REVKNGGWTS 

WKPLWLCGYGLTQSTVGIIGLGPH* 

AQAIARRLKPFGYQRFLYTGRQP\R 

PEEAA\EF\QAEFVSTPEAGWPNLILI 

VVACSLTPATNEGLCKKDFFQKMKE 

TAVFINISRGDWVNQGRPCTRALAS 

GKI\AAAGTGM*T\SPEPLPYKPPFSL 

TLERIVVILPHIG\SATPQEPGNTMFL 

VWAVNNLLDGLRGEP\MPSELKL 


1120 


6617 


1177 


518 


780 


EVLPSGPGLLALVLRLGSEFKLPHH 
EVSVGT/HPCQTSGAPARHRSTRDP 
VFPLSRGHNNPVPSWKHRAALTRH 
QTFLYCERGLPACIH 


1121 


6618 


1178 


117 


166 


ITMATGQKLMRAVRVFKFGGPEVL 
KLRSPIAVPIPKDHQVLIKVHACGW 


WO 01/79449 PCT/US01/08656 


SEQ ID 

ISU. UI 

nucleo-tide 
sequence 

SEQID 
SO: of 
peptide 
sequence 

Me 
tho 
d 

SEQID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /=possibIe nucleotide deletion; \=possible 
nucleotide insertion) 







NPVETYIRSGTYSRKPLLPYTPGVSD 

VA\GVI*A\VGDNASAFKKGDRVFT 

RQARSSGG/YMAEYGFFAGRTTLFY 

KLPGKTGTFKQGSLPSDFPYFYCFIR 

GL\IHS\ACV\KAGESVLVHGASGGV 

GLAACQIAVRAYGLEVLGTAGTEEG 

\QKICFCKNGAHEV\FNHREVNYIDK 

IKKYVGE\KGIDIWEMLANVN\LSK 

DL\SLLSHG\GRVIVV\GSRGTIEINPR 

DTMGORSPSIIGVTLFSSAKEVEFOO 

YAAAL\QAGMGIGWGGPVIRSQYPL 

ERVAEAHEDIIHGYSGATGKMDSSL 

MDD 

1122 

6619 

A 

1179 

264 

647 

NLGTCPFPVPALQCLLLVETVSRGS 
I ! PVSPI LFOLI YPSPPAVPSYSTNSL 
LPP*PCPASLPFYDSLYI*RPAFP*SH 
PSPSTPRTEGGWPSQSHPPCCPQAP 
APPPSLPASLSQRHLLPPLSHHSC 

1123 

6620 

A 

1180 

1 

575 

NFALEAKNSARAISSVVQTPIGHSTE 

EDQGLLSTSLWGKWKCGKNAGRK 

KPLGKAPLVVL/HPWDPKGSFEQAL 

GNPVPPALCPSWGNPPKSRAHGK\K 

VI T\ST fiFMPIKHPfi*S < ?KGTFAf)PD 

VNCTCDK\LHVDPENFKLLG\NVLV 

TVLGNPIFGKEFTPEGCRASWAERW 

VTWSWPVPCSSRIPLKPLGP 

1124 

6621 

A 

1182 

265 

714 

HFTYKYTAGTTIKSKNICITPKSYSC 
TFLVINTLTTPLSNHYSGFSLLRLVLI 

AHCNLCLLD*SDSPASASQV\AGITG 
TRHQACLIFVFLVETGYPHVG*ARL 
ELLTSGDPSPPWPPKVLGLQT 

1125 

6622 

A 

1183 

84 

1009 

HSMMMKIPWGS1PVLMLLLLLGLID 

ISQAQLSCTGPPAIPGIPGIPGTPGPVD 

GQPGDPRG*KERKGFQGLAGDHGE 

RGEKGRPRGFLGNNPGKKFGPKGVP 

MGPKVGPGAPGTPQAPKGDSGDYK 

ATQKIAFSATRTINVP/LLRRSQ1AIRF 

GPR*FTNMNT\NYE\PRSGKFTLQGC 

PGLY*FNLSTPVSRGvNLCVMLMRG 

RERAQ\KV\VTFC\DYCLTNTFPGPPP 

VfiMfiPOl KKAPKGGOCtOEKKTWF 

V VJl'IVJI V(ijl^-lvr\l XV VJ VJ VJ VJ VJ JwXViV. ± V VI 

LQA\TDKN\SLTGAWEGA\NSIFSRV 
PGFFPDMGGPDLWAGFTSTPGSPCP 
ATLTIPPTTTI 

1 Izo 

ooZi 

A 

A 

1 1 Q/l 

lis 

1 ID 


fiWRGLPHCVPGRNCCSVLLMGS/C 

VJ VV 1VVJ J-Jl XAVs ▼ A VJ iVl 1 Vj» V»/0 V uJUlrlvlU/ \_/ 

CL*GPHAL*KPSCSVRCCWPEAPLH 
SKTDPRLSAA*PPFC*VR*MRYGLR 
TVCAQILSV 

1127 

6624 

A 

1185 

3 

734 

GGSRERARPSPESRRLPSRRSAPHRP 

PPQPCEQDNSPRK1QFTVPLLEPHLD 

PEAAEQI\RRRRPHP\AT\LVLTSD\QS 

SP\ETAEDRIPNPHLKSTLVAMSPRQR 

KKMTRITPTMKELQMMVEHSPGGN 

RQQGEEP* RGPLESTG\LQES\RPPGI 

PDTEVECKCWAPFGTAKK\TAECIP\ 

KTHERGSKEPSTKEPSTH\IPPL\DSK 

GANCGSERGGGILGSRLQFGNAWT 


SEQ ID 
INO: or 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X°Un known; *=Stop 
codon; /^possible nucleotide deletion; V=>possible 

nticlontidp iilCf»rtirm\ 

IJUUWvllUw ill owl 11 UUt 







PDLFLIPSLLGKISCF 

1128 

6625 

A 

1186 

1 

1359 

GGPRDVNCRCRRLRAAETPPAWHL 

CMRSGAPTPPAAAMESETEPEPVTL 

LVKSPNHRHRDLELSG\DRGWSVG 

HLKAHLSRVWPERPHPEDQRLIYSG 

KLLLDHQCLRDLLPKQEKRHVXLHL 

VCNVKSP/SQKLPEINAKVA*IPQRE 

PVGSN\RGQY\PEDSSSDGFKGKGKF 

FR\NLSFPWGW\ENIFKGLEACPSRH 

FQGLG\PGFLPVYTPYGV/WLQLSW 

FQQIYARQYYMQYLAATAASGAFV 

PPPSAQEIPVVSAPAPAPI\HNQFPA\ 

ENQPANSGMLAPSSGLFNPGXANQN 

FAG*IAQGGPIVEEDDEIN\RDWLD 

WTYSAATFSVFLS1LYFYSSLSRFLM 

VMGATVVMYLHHVG\WFPFRPRPV 

PNFPNDC\PPPDVVNSDPH\NN\LRE 

GTD\PELKDP\NHLPP\DRDVTRMGE 

AGPGPFLYGGTAWVCSFKDFSLASS 

FFPEGPPSPSAN 

1129 

6626 

A 

1187 

314 

1614 


1130 

6627 

A 

1188 

128 

1910 

RVVDRGRRWDSPSPLLGGGTWPGR 

SSLRFASASSDSDSDSGLYRASLHPS 

PGRAALGLCLYLTKTSCCAAIGTLY 

WGNIAYKQEAYSLSGENFFMSETE 

NSCSPFMSSLLLQTEDTKKLQSKNL 

FILLIKPTNKPKMSVNVNPQRCQTQ 

F\YRLQDAPRLIAQG*GAKGNGNPR 

PVI/MSNMVDVGKGSLIGPPNVSPPN 

IPIPELGAQTQFDVKNDRYIVNG\SH 

GGELSCQDML\DGFH*KNFVLCP\EC 

GGFLETDLH\VNPK\KQTIGNS/CGK 

ACGVYRGMLDTHHKLCTFILKNPPE 

NSDSGTGKKEKEKKNRKGKDKEN 

GSVSSSETPPPPPPPNEINPPPHTMEE 

EEDDDWGEDTTEEAQRRRMDEISD 

HAKVLTLSDDLERTIEERVNTLFFLF 

LNKIKEEGVIDSSDKEIVAEAERLDV 

KAMGPLVLTEVLFNEKIREQI\KKY 

RRHFLRFCHNNKKAQRYLLHGLEC 

VVAMHQA\QLISKI\PHILKEMYDAD 

LXLEEGFFISWSEKASKKYVSKELAK 

EIRVKAEPFIKWLKEAEEESSGGEEE 

DEDENIEWYS\KAASVPNV\EFVKS 

DNKDDDIVDIDAHLKGWMQPSLTV 

1131 

6628 

A 

1189 

132 

362 

RRVDWKIQKIS1GSSE*KLFNESHGI 
FLGLQRIDEELTGKSRKSQLVRVSK 
NYRSVIRACMEEMHQVAIAAKDPA 
NGRQFSSQVSILSAMELIWNLCEILF 
IEVAPAGPL 

1132 

6629 

A 

1190 

97 

355 

AFSYNCPSKISCQRKSQHFHLGGLY 

VIILFLFQKGQGVCQQSHPERPEGNP 

RRHQGQKERSCVGKTLL*LPEQDFV 

PEKVSAFPPWWSLCHHPVSVPKGA 

RRLSAESSRKTGGQSETSSRTEGEK 

LRRQNASPRTYGGTRDTFPGMSVG 

H 

1133 

6630 

A 

1191 

137 

474 



WO 01/79449 


PCTYUS01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 

G 

SEQ ID NO: 
inUSSN 

Uy/77U t IuU 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=>possible nucleotide deletion; \=possible 
nucleotide insertion) 

1134 

6631 

A 

1192 

307 

583 


1135 

6632 

B 

1193 

1 

969 

MRPYSQTVFSTQYRWVEQHLGPQF 

VERIILTRDKTWLGDLLIDDKDTV 

RGVHKSMCPGVTAASWGRAKDAM 

SICGCWLRELWDTSSRENSAAVKT 

GREADKPEVTKQTQFSTRKDEQAC 

SGYPYSDCWLAIELLHSAEPQAPRS 

RSDANASRSGLPLRAGCETRLRLGV 

SCSACKPKPSVLRRCLLSARPPLCGP 

LHASFLPGVDSKSGKTAPRTDCTQS 

TALTGSGGAGDTLRIDEELTGKSRK 

SHTQVFICTSPLLKYHHCVGEKYR 

WVEQHLGPQFVERIILTRDKTVVLG 

DLLIDDKDTVRGWTRYPWQPSSLA 

CPD* 

1136 

6633 

A 

1194 

834 

1834 

PSWCCRAGWMKPDKMLSKEADAS 

PASAGICRDHGGPDEDNAHS*SWE 

HPDTDRAGAASGSTGTRNVERYLQ 

DSTFATSPHLAESLLKIMLGDEAALL 

EQKELLSNWYHFLVTRLLYSNPTV 

KPIDLHYYAQSSLDLFLG/E*EQPSN 

PWTTSCWQPLSLTS1/NVIKECSIALS 

NWWFVAHLTDLLDHCKLLQSHNL 

YFGSNMREFLLLEYASGLFAHPSLW 

QWGVDYFDYCPELGRVSLGAAHLS 

GYL*TPRQKA\RKVLRICEAAADD* 

TSSQHL*DLSHESPSATIAWVLASF 

WSIRVKGCRLCPRSCQTGSSGITVSE 

AAFLIWISLATWGQP 

1137 

6634 

A 

1195 

32 

393 


1138 

6635 

A 

1196 

102 

888 

RNLQETAIMAEKP\KLHYFNARRRM 

ESTARWLL\AAAG\VEFEEKFIKIWQ 

K1WDKFKEMMGYFDVSQQVPN/W 

*D* WG WKL VQTRAISQLTFASKLQP 

SYGGKDIKGRRALDLI/DVF*EGFSQ 

ILGLKLFPPFLPVMSHPEEKRCPSLA 

LGSKRKIKK1GYLPLPFEKSLKEPMG 

QDYPCWANKLEPGLDIHLWGTFFY 

YVEVELGLLGLISSFPLLEGPWKTRI 

QLTCPTVEEVSLQPRAAPREAPPRD 

EKPALEEAKEDFPGF 

1139 

6636 

A 

1199 

46 

399 

PGSKYDKTAILVSHLNFLNLLIKQP 

NPTQISPQGMFKRGGAR/PLLKTGPF 

LPTWGKDSFCYGKHSNSGWFEAKG 

FRRLPSFKNERKRNIIGSPPPRLWIS 

SESPHAPFGTKSIPED 

1140 

6637 

A 

1200 

37 

454 

PGSKYDKTV1LVSHLNFLNLLIKQP 

NPTQISPQGRPPPPQCRVTEWTTTAS 

TQTQAGLKLKDSDDFQVSRMKEKE 

T**GAHHQDL*LFPQKAPMPHL*QN 

PFLKTDSRRNQVRSRTFPTS1TFQNV 

GRSLPNTIYSGKK 

1141 

6638 

A 

1201 

298 

523 

LLKVQSRQKHWAVGLRTLVVRGA 
VLIRVPPLREPLLAPPILVGASSRENI 
SR*VTGCSPTHSSTPPLATSPRQRA 

1142 

6639 

A 

1202 

80 

589 

IFLNLLIKQPNPTQISPQGRPPPPHVQ 
GD*VYNYGPTPASQPERRALKRGI* 
RKGGEGGPRERVWAGAPPGLPTPF 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possib!e 
nucleotide insertion) 







PENQGEKACG/GGRCLSPISSLQGPR 
FPADWWEIPIL\*SPDLHRDWGPAPP 
PPTSSNRPTPTVLRQERSGLG\PPSPP 
GHLGKLVFFQLAWDSFL 

1143 

6640 

A 

1203 

2 

661 

FVEADRGLAVGSLRPLNLRGARAA 

AFRFAAGRESRGKGTPGERRTRVA 

VAGKVTSVPFVVSQGREHSHAPTR 

LPTIPPAAMSAAG\ARGLRAA\YHRL 

LDKVE\LMLPEKLRP\LYNQSRQVPR 

TVFFWAPIMKWGVWCAGLA\DMA 

QTLPEKP*AQLQSACFGWLTRGLIW 

\SRILHL*IIPNTNW\SLFGW*FPFVG 

AAGASQLF\RTW\RYNPRTKKLKAH 

K 

1144 

6641 

A 

1204 

20 

400 


1145 

6642 

A 

1205 

457 

1011 

SRRPWGHFTEEDQGLLSQSLWGQ/V 

♦NVEKMLGRKKPLGKAPLVVLPPW 

TPRGSFEKALGNLVLLPSCPSMGNP 

QKSRAPWARRVLTFPWGDAHKSTP 

G* SSKGTFCPA*SEPA/HCDKLH/VL 

DPENFAKLLGVNVLVTRFGQSIFGKE 

FHPLRCKAS\WQKMGELQVGHFPV 

LQIPLSSL\PMMQSFQG 

1146 

6643 

A 

1206 

3 

452 


1147 

6644 

A 

1207 

9 

485 

NSARATDSERTHHGARLLPDKTNV 

KAXAWGKVGAHAGEYGAEALERM 

FLSFPT\TKTYFPHFDL\SHGFCPRLK 

GPRQRRWPDALTK\AVA\HVDGHA 

QTALSGPEATLHGAQSFGVDPVQLS 

SS*SHLPCWLTLGRPPSPAEFNPCRV 

ARLPWNKFPWVSC 

1148 

6645 

A 

1208 

37 

298 

RQGLPLSPRLECHDRIIAHCNLEVLG 
SSSPPTPASPIA*\TTGVGHHTQLPFK 
LPYFFHSGLFFFLKKILYQFCDTYRA 
RISSDFCR 

1149 

6646 

A 

1209 

49 

564 

SQTPMGHFTEED\KATI\TSLWGK\V 

NVEKVSWKEKTPGKGSLWLPPWTP 

RGSRDSFGQTCPSALCPSMGKPPKS 

RAPWPRRVLTSLGRCQQSTWDDPQ 

GAPFAQA*SELHC\DKPAMWDPE\N 

FKAPGEMLLVTRFGQFHFRAKNFTP 

EGCRASWAERWVTAGVUSALVPSR 

YH 

1150 

6647 

A 

1210 

134 

673 

QRRTKATIT\SLWGKGEMWKDAGG 

E\TPGKGSLVVYP\WTQ\RFFD\SFG\ 

NLSSAFCPSMGKPPKSRLHGK\KYL 

TSLGRCP*KHL\DDLNGHLLPKPDV 

NLHODKPAMWDPGGTFKLPGENV 

AGLTRFWAIPFSGKEF/TPLEVARLP 

WQKMAEDGDWQWASCPVPPRIPL 

EAHWPMISELFKG 

1151 

6648 

C 

1211 

50 

373 

MGTVGSAWAAAPVQTHMFCSSSSC 

ISSLSTLPDSAAPLSPLASSWPVPPRP 

LLHWPFDQSSLLFSLSPSFXXXXXX 

XXXXXXXXXXXXXXXXXXXXXX 

RALTHTSLA* 

1152 

6649 

C 

1212 

45 

424 

MNGDXGLCLAAAPDQTHMLGSSSS 


WO 01/79449 


PCT7US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tbo 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







C1SSLSTLPDSAAPLSPLASSWPVPP 

RPLLHWPFDQSSLLFSLSPSFXXXX 

XXXXXXXXXXXXXXXXXXXXXX 

XXXXLTHTFLSLERGGVLSISIRMW 

VRLS* 

1153 

6650 

A 

1213 

97 

1382 

SACAWRLPSPGPSAMWPLWRLVSL 

LALSQALPFEQRGFWDFTLDDGPF 

MMNDEEASGADTSGVL\DPDSVTPT 

\YSAMCPF\GC\H\CHLRWQ/CAPTL 

GLKSVPKEISPDTTLLDLQNNDISEL 

RKDDFKGLQHLYALVLWNNKISKI 

HEKAFSPLRKLQKLYISQNHLVEIPP 

NLP\SSLV\ELRIHDNRIRKVPKGVFS 

GLRNMNCIE\MGGNPLENSGF\EPG\ 

AFDGLKLN/YT*RHLQRSKLTGHSP 

KYLPETLNELHLDHNKIQGHRTGR 

DLLRYSKLYRLGLGHN\QIRMIENG\ 

SLSFLPTLREV\HL\DNNKLAR\VPSG 

FP\DLKLLQWA r ILHSNNIHQSGVF 

NEFLFPWGFGGEAGPYYNGHSAFS 

TTPLPYW/ERLQPATFRWVTDRLAI 

QFGQLQKVEAAAATLVSQWGSLG 

NRARHPDGEAEPGS 

1154 

6651 

A 

1214 

938 

1501 

AGVGPDPGFLFQGIVANDSHPTALL 

KRMFASGGRRSWCQPFQGTPAMG 

GPWAKGCLGPASCAAKVGGPHPKT 

NPGPKPTGGQGFPATGLRGVGISPP 

K*PCQ*AVQPGSHCPATCAEPSPPW 

G/PGVPRFEAPPPQTPP*PRLAVPETG 

EPPLGVQQKPP/QPMPGPGAPLEN*S 

ASGGPRSPRGWESVWF 

1155 

6652 

A 

1215 

883 

1216 

YISNSQLVQHFFFFFFFLFFFFETESC 

PVAQAGTQWCDLGSLQPPPP\G*SN 

SPASASRVAGITGVHHHAGANFLYL 

*QRWGFTMLAQAGLELLTS*STRLG 

LPKCWGLQT 

1156 

6653 

A 

1216 

197 

821 

RLFHSNQTVDHSQKNVDITLKG\RP 

SNRVRAPKGTLR\RDFNPHQM*NSA 

LLGKEQQRGFRVD\KWWGYQKGN 

WPTRSGLFGSHVQD\MIKGWLPLGL 

PVTKMRVSVYAHFPHPTLLSRENGV 

SLLKSRNFLG\EKYIPQGFRMKTRVL 

LCQYLKAQKR*N*SLEGND\IGLVS\ 

NFSRLLIPASPTRLKTKGIRK\FLDGI 

FCLLEKGLFRQA 

1157 

6654 

A 

1217 

241 

514 

DGQRLGKPHFVLLVLQ*LQTGLWS 
WWGKLGVGE/MLGVGEILGSAFISP 
VLERVEGSGSGGNQAGPREEGWLG 
KPLRPEAPPSTFIQHHTLG 

1158 

6655 

A 

1218 

3 

496 

SSGLFTQQSPEAWFCVSKCSYPAW 

QNVKVIVDSHKLQEIIQRSLIPFAQF 

PAMVTVCKAPGRFCHPCWHV*P/HP 

HSHDTEHGHHHKCLVLCYPFTASA 

PPSCPAPLRCIYGCILSCLVDVPWRT 

DRLNKQRVCKEAQSRREGSPAWLI 

GQHSHRLALPLLAALS 

1159 

6656 

A 

1219 

1326 

2014 

QLMIYTFRTELAHWPGDQKHYFHT 
CVMILFFLRQSLALLPRLGVQWHDL 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
iNU: oi 
nucleo-tide 
sequence 

SEQ ID 
1NU: oi 
peptide 
sequence 

Me 

InO 

d 

SEQ ID NO: 

an IfCClVI 

in UaaIN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X-Unknown; *=^Stop 
codon; /^possible nucleotide deletion; \=possib!e 
nucleotide insertion) 







GSLQPA\PPPPGFRLFSCLSLPSS*DY 

RHVPP\PQL1FFFVLLVETGF/TPMLG 

RMVSIS*/PSDDPPTSASQSARITGVS 

HSWVNSCDLPLKNFFFFLEESHSVS 

QAGVQWHDLGSLQPPLPGFKLFCL 

CLLNSWDYRHAPPCPANFCIFSRDG 

VSPCWPAWSPSLHLVIRPPWPPEVL 

GLQA 

1160 

6657 

A 

1222 

1 

459 

IEIHIKCGGIPAVLAAPAMGLELFLD 

LVSQPSRAVYIFAKKNGIPLELRTV 

DLVKGQHKSKEFLQINSLGKLPTLK 

DGDFILTE/SANRSVYLQGGCWHIT 

EMRFSKCKAEGPAKSRSAILIYLSC 

KYQTPDHWYPSDLQARARVHEYL 

GWHAD 

1161 

6658 

A 

1223 

2 

310 


1162 

6659 

A 

1224 

205 

454 


1163 

6660 

A 

1225 

2 

384 


1164 

6661 

A 

1226 

1 

1276 

MPGLGFREKKGGSRTVIPASRGCGL 

PAPILCTKWELPLSGSSRCLAAAAL 

QGTVWTAESSSLTPAFQSRGWGLIP 

YFPARRDPATAAAHTALSAFTA1PA 

VLAAPAMGLELFL\DLVSQPSRAVY 

IFAKKNGIPLELRTVDLVKGQHKSK 

EFLQINSLGKLPTLKDGDFILTESLAI 

LIYLSC\KYQTA\DH\WYP\SDLQGFG 

ARVHEVPWAW\HADLHPVGTFWY 

YPWGVQGVWGHSLGVQVPEEKVG 

TQTRTAMDQALQWLEDKFLGDRPF 

LAGQQVTLADLMALEGS*CKPVAL 

GYELFEGRPRL\AAWRR\RVEAFLG\ 

AELCQEAH\NIILSILEQAA\KKTLPT 

P\SP\EAYQAMLLRNRPGSPEGSGM 

GAKEISNKDSFCYLLAPFYLSLLPQS 

LLSKLQCEALHRQRHSSVLWQVLL 

LLRCKHT 

1165 

6662 

A 

1227 

151 

278 

G*KYDSDIYHEAVVPHQNQMSARS 
DKEKKRFIFYSSYYIPFC 

1166 

6663 

B 

1228 

1 

690 

f 

MASWDEKDLTVPQPDTRKGSVLRC 

GLSSRALRWAGRGHVAAGWRPLA 

PESAGGWGMAAAMVPGRSESWER 

GEPGRPALYFCGSIRGGREDRTLYE 

RIVSRLRRFGTVLTEHVAAAELGAR 

GEEAAGGDRLIHEQDLEWLQQADV 

VVAEVTQPSLGVGYELGRAVAFNK 

RILCLFRPQSGRVLSAMIRGAADGS 

RFQVWDYEEGEVEALLDRYFEADP 

PGQVAASPDPTT* 

1167 

6664 

B 

1229 

1 

975 

MSPPGREQGLLLNLLRPSGLDNAG 

KTTILKKFNGEDIDTISPTLGFNIKTL 

EHRGFKLNIWDVGGQKSLRSYWRN 

YFESTDGLIWVVDSADRQRMQDCQ 

RELQSLLVEEVGSSYPLCTWRFFSY 

LRIEQMYNLVLYRDIQFPDFCFNSN 

TDWSKGLKTHARFGNTSLHVAHTD 

STNTTNFVDVWRGRTKSLACLLQL 

SSLTCIYTAGKMRLQDRIATFFFPKG 

MMLTTAALMLFFLHLGIFIRDVHNF 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location or 
first codon 
Tor peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; V=possible 
nucleotide insertion) 







CITYHYDHMSFHYTVVLMFSQVISI 
CWAAMGSLYAEMTENKYVCFSAL 
T1LMLNGAMFFNRLSLEFLAIEYREE 
HH* 

1168 

6665 

A 

1230 

1 

211 


1169 

6666 

A 

1231 

950 

1326 

RPESQRANGVDSGPNLKTVPQPDTR 

KGSVLKWISKRGKPLAVEIEESHCL\ 

CLPLRTECLGIKPMVHLFSCTRPVIV 

PSLELNYDVDSIAHMFVADLLLMIT 

LPSYDIPFYCLVFQNLLVLEFQYL 

1170 

6667 

A 

1232 

271 

927 

NQGLRHVRLCRMCLVNQMFASS1L 

GKSHHHSLVPINQGHNALWKAAG\ 

PLPLKAGYC\QGFSPCHSLKYG\SW 

DEKDLTVPQRDTHKRSVNLKVELVQ 

RGKNLPVEDGGKAHCLPELPPGELE 

CPG/ILKHGLYHWSSSEMGEKPAPM 

VGG\RHV\CSNAALVIPLLPLRCLGG 

EKHKSGLVHARPVIWPSLELNHDTD 

SFAHMFFADLLLIITLPS\YY1PFC 

1171 

6668 

A 

1233 

62 

1158 

GHLCARPETSLLQVRPGPLPSSFSG 

MDVGPSSLPHLGLKLLLLLLLLPLR 

GQANTGCYGIPGMPGLPGAPG\KDG 

YDGLPGPKGEPGMPAISWIRGPKGQ 

KGEPG\LPGHSWGKMGPMGPPGDC 

QGLPGPLGDFPG\EPG*G\GRYKQKF 

QFSFSRLIRQTHQPPRT\NSLIRFNAV 

L\TNPT*IYNTCTG*FTSKVPGLY\YF 

VYHASH\TANWCWLYRSG\AKVV 

TFCGPHVPKPISSNSGGCACLRLQV 

GRGRCGWLS\MTYYGHGWGIQGL* 

KSVFSGLPWLLPRTKGGARCGSRPT 

GPSPSPQLPAWTQPYWPVC1LALDH 

SPHQMDFSPPGSPPLTHPHCTPLPM 

GSLLPLNFFRSHCLCGSWDT 

1172 

6669 

A 

1234 

2 

907 

AVAFGAKGTDPAEARSSRGIEEAGP 

RAHGRAGREPERRRSRQQRRGGLQ 

ARRSTLLKTCARARATAPGAMKM 

VAPWTRFYSNSCCLCCHVRTGTILL 

GVWYLIINAVVLVDFIECPG*SGSSI 

TFQVLELGGDFEFMDDAN/NGAFAI 

AISLLMILICAMATYGAYKQRAAWI 

IPFFCYQIFDFAL\TC*VAISGLIYANS 

IQENIRELPPKFPYRDDAMSVNP\TC 

LVFIILLFISISLTFKGYLISCVWNCY 

RYINGRNSA*CPWFMLPAMTLRCC 

YPPYDDATVNGAAKEPPPPYRVCLS 

L 

1173 

6670 

A 

1235 

966 

1564 

NDFFKTNKSSIIRNSCLTAILVFLLCC 

YDLTLTGTLF*ILTFRWL*LGLIIRLN 

\MASRST1YVYGSQPSFLFTFENTDF 

YQLWILFLVICLPFLFKLFKTIFLWP 

KLLTLENQILHQIQCFSWFLMSAPI 

TTVRGIHVLPVIV1LQTDLAWINQLL 

HLLFSKLGVYDDSVQDWVSLMWN 

GTHGRKCRbQr YMPIKGHL 

1174 

6671 

A 

1236 

155 

303 

AFQEGDPMFKPSRCPCPYLPKPCPS 
TCLP/SFAFRAWTQVP*CCVCKCPG 

1175 

6672 

A 

1237 

739 

1629 

GTSQMPTSAVPSLLSPSKFARKGFS 


WO 01/79449 MM MM PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in I ISSN 

09/770,160 

Nucleotide 

lUCallUIl UI 

first codon 
for peptide 
sequence 

Nucleotide 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; A=possible nucleotide deletion; \=possible 
nucleotide insertion) 







KVLQLCLYRRIGLCPAPPPPPVPTQC 
PRPQELIGP/WRE*CPPGSFRNSPRNL 
FSFLQELPEESV 

1176 

6673 

A 

1238 

1 

554 

NFALEAKNSARAISSLVPDA\MGHF 

TEED\KATI\TSLWGKGEMWKDAGG 

ET\LGRL\LVVYP\WT\QRFFD\SFGN 

LSSASAHPWATPKVKAHG\KKVLT\ 

SLGDA\TKHL\DDLKG\TFCPSLK*TC 

rc*QACNVGS*GTFKLPGEILLVTRF 

WQSHFRQKNFTPEGCRLSWAERW 

VTWSWPVPLFLPDYH 

1177 

6674 

A 

1239 

29 

454 


1178 

6675 

A 

1240 

1 

1368 


1179 

6676 

A 

1241 

1 

1266 

LSRVAEFEHLGWSPKPPTTCTPACQ 

GLSGAAMKSLVLLLCLAQLWGCHS 

APHGPGLIYRQPNCDDPETEEAALV 

AIDYINQNLPWGYKHTLNQIDNEVK 

VWPQQPSGRAVLRFEIRTPWGTTLP 

CCWDPTLVGQDASLEGSLKEHAVE 

GDCDFQLLKLDGKF\SVVYAKCD\S 

SQDSAEDV\RKVCQH\CPLLAPL\ND 

TRVVHAAKSCPGPPFNAQNNGFQF 

FSLEEISR\AQLVPLPPS\TYV\EFTVS 

G\TDLLFA*KKATEAAKCNLSGQKS 

NMGFCK\ATLSEKLGSGQRLQLTCT 

VF\QTQPVT\SQPNPEGANEAVPTPV\ 

VDP\DAPPSPPLGAPGLLPSWLTPKT 

TMVLL\AAPPGHQLHR\AHYDLCHT 

FMGVVS\LGSPFRRKCSHPRKNT/RT 

VVEA* WLGA AAG ATGFLPLFRGGI 

RHFKV 

1 1SO 

DO / / 

A 
A 


1 134 

1247 


f 1 Q 1 

1 1 O 1 

DO/o 

A 

A 


1 11 A 

i oil/ 

1517 

KLNMVFKKISHGMQLRKSYNLFYQ 
KSGKIMT*IWE1FFPEFFFIFPPTY\LF 
LKREFLLNEPS 

1182 

6679 

A 

1244 

213 

287 


1183 

6680 

C 

1245 

249 

323 

MYKLRRKLEDRNKIENENIVKSFR* 

1184 

6681 

A 

1246 

1021 

1985 

IAWAFKINWLPIIFLIFSVLFYPIFGFI 

FFYLLYFSNTCLS/FVFPFSYLKLLT1 

FSFSILFLSSNFLLSLYILPLAFCFLAC 

LFFFCFLLFRFKFIFFLPKSMFLSSNF 

PIF/CIFFFFAFPYSCLPVFHCQTFYLL 

QIWIL/IHSQLIFPFSSAFSKCG*QGF 

*MTQFSLRHGFQGLLITFGLISF*KIA 

FKLF*SPTTFKLNFKLFK\YLHFNLK 

ALPSLLGLNSGLL*/FLSHQFSFKYIF 

YSMKKLHLNLRHVLETVLSFPSSCY 

SSKFVHFALSFLFPSLSFFFCFLLFRF 

KFIFFL/RKSMFLSSNFPIFSILFYLPIF 

F 

1183 

6682 

A 

1248 

147 

460 

PFYKNCVSIVVVVFETGSCSVTQAG 
VQWHDFSSLRPRPPG\SSDPPTSASR 
VAGTTG\MCHHIQLIF*SFFIETGTHY 

QA 

1186 

6683 

A 

1249 

168 

407 

ISHTREHSLDFFSFLFFFETESH/STR 
RLEYNGGLSAHCNLHLLGSSDLPTS 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
inu: or 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 

tllO 

d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location or 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







AC*VAGTTSIWQHVQLTFVFFFFLLI 
IIF 

1187 

6684 

A 

1250 

163 

396 

TRFPILGSIA*IFFLFFFFFETESH/STR 
RLEYNGGFSAHCNLHLLGSSDLPTS 
AC*VAGTTS1WQHVQLTFVFFFFC 

1188 

668S 

A 

1251 

37 

698 

RSSSPALEGGKRKVDMECLGSTIGP 

GPGSGQPACPLIWLHC*NPVEQAGP 

KP/PARPARSGP/QPPPTAGPA/PVGP 

KPPPTAGPA/PVGPKPLPTAGPARS 

WRP*PSGARGIKETHTEI*VP/PQGFF 

ACLIPTAPTGPTD* * WLHLDLPTTPV 

QPHPEAVQHTGGSCLTPYDGPPPPQ 

PISSKHPLPSYPTPSPKLPLEHPSPQIL 

GETDFSNHQILVSHSLVSV 

1189 

6686 

C 

1252 

150 

308 

MALQRTHXLLLLLLLTLLGLGLVQ 

XPMARMXCTSDSCGNTCTLRRQVA 

SDRY* 

1190 

6687 

A 

1253 

2 

470 


1191 

6688 

A 

1254 

92 

1047 

RAAWQGGAEASGPPSALRGGSALA 

AAARWGPAEEGARIPGSFFAWAAP 

AAPGAGAAAAPAPRADPGGRGLLG 

EAGRGPNYAEAGTPTLHTLPRPHLR 

PCTHTHTRTPTPPRCKAGFKRTSLR 

FLPAPHHRTREEEAGEKQNFFPLSC 

PFLRTCSEAEEPVLEEMVMGLGVLL 

LVFVLGLGLTPPTL\AQDNSR\YTHF 

LTQHHIA\KTRGR\DPLSCKTFMRSR 

G\LTSPCKDI\NTFIH\G\NKRTLKGQS 

CEN\KNGKPLTEKNLKE*SKSFLSQV 

T\TCK\LHGGSP\WPP\C\QYRATAGF 

RNVVVACENGLPVHLDQSIFRRP 

1192 

6689 

A 

1255 

199 

792 

PGSTAAADQRSRNWNPGRVRKKPD 

LEGGCGTVLSGRWRSRRNRRTSGQ 

SLVPVYIYSPEYVSMCDSLAKIPKR 

ASMVHSLIEAYALHKQMRIVKPKV 

ASMEE\MATFHTGCFICSISRRSAKR 

AMMIIRTSLEYGVRFINFPATEGIFT 

LCSS\*GGATITAAQCLIAGMCKVAI 

NWFGGWHHAKKKTCVYVALYKAF 

1193 

6690 

A 

1256 

1368 

2229 

WHPRQVLTGNDEV1GQVLSTLKSA 

DVPYTAALTAVRPSRVARDWSVEA 

GGLGRQLLQKQPVSPV1HPPESYND 

TAPRILFWAQNFSVA\YKDQWEDLT 

PLTFGVQELNLTGSFWNDSFARLS\L 

TYERIFATTA\TFSFIPAHQRYPPSAR 

HWFTMERLEVHSNGSVAYFNASQV 

TGPSIYSFHC\EYVSSLSQEG*SPRW 

ARTQPSSWQMMLAQGFPDPGFSTL 

MGEQFSYA\SDCGQASSSPGIWMGL 

LTSLFMLFIFTYGLHMILSLKTMDRF 

\DEHKGPTISLTQIV 

1194 

6691 

A 

1257 

2 

630 

PDSSGPHRLRENPPWCLSPA\DKTN 

VK/APAWGKVGAHAVRSMCAEAL 

ER\MVLFPPPTPKPYFPHF\DLSHG\S 

AQV*GPRARKVADALD/TNAVAHR 

GTD/DCPNGWPPLSDLH\AHKL\RV 

GPGSTFKLLKATC/HCLGEPWAAHL 

P\AEFQPLAVATSSLGTKFPGFLVEA 


WO 01/79449 


PCTYUS01/08656 


SEQ ID 
NO: of 
nucleotide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







PLLTFQITFKGWKPRVGHAFFALW 
ASPQPLLPFPAPVPPWSFE 

1195 

6692 

A 

1258 

207 

591 


1196 

6693 

A 

1259 

1 

541 

WNLSASPARQHPLLPHPVSCILLKPS 

SKKCLPNLTGYYRFVSQKNMEDYL 

QALSSPH\DISLAVRKIALLLKPDKEI 

EH\QGNHMTVRTLSTFRNYTVQF*/ 

HVGVEFEEDLRSVXDGRKCQGCPW 

HCNFSSQSHLLSSRAMFSTAYKIQL 

ALHIHGFCIHSFSKLQMEKYLGKIKI 

KNNNTK 

1197 

6694 

A 

1260 

214 

491 

ESHGAQ1YIPHFCVPWSRWGNVRR 
CEALAVIYVTNNKIYVLSDPEISCLQ 
PTSIN*QSNLLACK*GKN/RKTGQAR 
CHACNPSTLEGRSRRIN 

1198 

6695 

C 

1261 

48 

173 

MVRKRMEMKMRKLSQLRASGQLK 
MMRMXMSIPRSRRPRGIX* 

1199 

6696 

A 

1262 

59 

306 

FGTDRTAVQTSSSQRLCLPWVAQK 
TYWLLVPSSLLKDLKEKKEVVEEA* 
NGRDAPANGNAVSVCFAPEPWQLP 
PHKIFPVLL 

1200 

6697 

A 

1263 

279 

889 

TLAVFL1PCIGSPACPTMSDAAWDT 

SSEITTKD\LKEKKEVVEEA\ENGRD 

APANGNAVEEEDGDEDEEAESATG 

KAGQPED\DEDDDVDYQGKQKTDE 

D*IRQQKRKKLNLKKKRPAHRGQC 

HPADDTRSPPPNPNHENLQQGREKR 

NQNFQGPAFFLKSTLKRKFVCIFYL 

HFIFLYILLRVSHFLMISDDQTSLPER 

SLSYF 

1201 

6698 

A 

1264 

358 

508 

DDDVDTKKQKTDEDD*TAKKRKV 

KLKKKKAAVTYSPSTSRLRYLYVFT 

FE 

1202 

6699 

B 

1265 

46 

386 

X1RHESGSRSHSHCSTLSSIGDVAKK 

LGEMWNNTAADDKQPYEKKAAKL 

KEKYEKDIAAYRAKGKPDAAKKG 

VVKAEKSKKKKEEEEDEEDEEDEE 

EEEDXEDDDEEEDDDDE* 

1203 

6700 

A 

1266 

263 

484 


1204 

6701 

A 

1267 

157 

886 

TWGKGDLKKPRANMSSYAFFVQTC 

RGG\HKKKHPDASVNFS/ESFSKKCS 

ERWKTMSA*R/EKGKFEDMAKA\D 

KARYER\EMKTYIPPQRGRQKRKFK 

DSQLHPRGPPSGLLSSSCSEYRPKIK\ 

GEHP\GL\SIGDVAKKLGRDVGINTA 

AD\DKQPY\ERRAAKLKEKYEKDIA 

AYRAKGKPDAAKKGWVKAEKSKK 

KKEEEEGEEDEEDEEDEEIEE\DEED 

EEDEEEDDGLMNKLGSGAVFFFSCL 

1205 

6702 

A 

1270 

2 

343 


1206 

6703 

A 

1271 

3 

647 


1207 

6704 

A 

1272 

59 

1297 

NHASVQVKLWILSRSYLQLTMEAN 

GL*PQGLPDLKNDTFL*AAWGEETD 

YTPVWCMRQAGRYLPEFRETRAAQ 

DFFSTCRSPEACCELTLQPLRRFPLD 

AAIIFSDILVVPQMFSPPPGTGQWKV 

TMVPGKGPSFPEPLREEQDLERL\RD 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tbo 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possibJe nucleotide deletion; \=possib!e 
nucleotide insertion) 







P\EVVASE\LGYGFQAIT\LTRQ\RLA 

GTCAG*LAFAGAPMEP*MTYM\VE 

G\GGSST\MAQA\KRWLYQRPQ\ASH 

\QLLRILTO/ALWLPYLV\GQVVAGA 

\QALQLFES\HAGHFWPHSSFNKFCT 

ALTSRDVAQAK*RPRLPGSQALATR 

CPCIIFA\RDGHFAPGRSLAQAG\YE\ 

WGLDWTVAPKVKARECVGKTAVTx 

LQGNL\DPC\ALYA\SEEEIGQ\LVKQ 

MLDDFGPHRYMTPHLGHGLYPDMD 

PEHVGAFVDAVHKHSRLLRQN 

1208 

6705 

A 

1273 

7 

1047 


1209 

6706 

A 

1274 

7 

960 


1210 

6707 

A 

1275 

3 

53 


1211 

6708 

A 

1276 

237 

441 

AHFLHIVLVLLSCWYSVNRSRCTSQ 
QGVQSDMAQLLPLRQRLEGEALV 
ALGAGVERRHVRPGPRNST 

1212 

6709 

A 

1277 

94 

1003 

VRVSLSLRLECNGAILPHCNLHLPG 

SSDSPASASQVAGITGVCHHAPPIFV 

LLVETGFHHVGQAGLELLTSSDLPA 

WASQSAGITGGSH/LCLANVKYFKS 

HFKIFVIDETWFQHT*LLSLSSCD*V 

*TSI*/PQVLD*NVLCPDV/SQ*LLPW 

L1KIFLKYTLDCW*KRQAWLYWY* 

FGTELFGCP*GRQTLCFFFLRRSL/DS 

VAQAGVQ*CDLGSLQPLPPRFK*FS 

CLSFPNSWDYRCAPPLPANFCIFSRD 

/MGFAMLVRLVSELLTSGDPPASAS 

QSAGITGMSYRAWPKPYAFEVECR 

PCDN 

1213 

6710 

A 

1278 

4077 

5102 

KEASPAKRASGEGSRRLRVEAGGR 

CGKVCSGRGGGSPELRLRRQKMLR 

ASSQRNAAGHRGWRASGSRGSPTA 

AAERPKKGGGGSRAAQTASSSGTR 

RRRRRRLRESRRPRSRSGCRPPAFQ 

AAPDPPLPLPAGSHFRQATAELA/G 

RAPRRKWPKPAFAS/VGRGRGRAL 

ATFPTASED/SPRPPAAPR*HPGQGR 

GAGRRGLWEARGGAPAAFGAPQL 

ASCKGRRA/HTPSTINLFLNDPPPLP 

KH\PH*ASPPTLGIEFQHEIWRGQTS 

KLSQHPSFSLRTLLTYSSTQAAFEFL 

FCRCLPTGHVPSSLLHSAADTAVSG 

DYATEGWECHCCWGCWEAKVGV 

LLH 

1214 

6711 

A 

1279 

316 

1180 

KEASPAKRASGEGSRRLRVEAGGR 

CGKVCSGRGGGSPELRLRRQKMLR 

ASSQRNAAGHRGWRASGSRGSPTA 

AAERPKKGGGGSRAAQTASSSGTR 

RRRRRRLRESRRPRSRSGCRPPAFQ 

AAPDPPLPLPAGSHFRQATAELA/G 

RAPRRKWPKPAFAS/VGRGRGRAL 

ATFPTASED/SPRPPAAPR*HPGQGR 

GAGRRGLWEARGGAPAAFGAPQL 

ASTQAAFEFLFCRCLPTGHVPSS/rP 

PFSSRYSSFRRLCLIFRGCWTEVLNE 

GGANSDSWSLTKLESGSSECSS 

1215 

6712 

A 

1280 

2 

315 



WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: or 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tno 
d 

SEQ ID NO: 
in USSIN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /"possible nucleotide deletion; ^possible 
nucleotide insertion) 

1216 

6713 

A 

1281 

195 

739 

KFSSRITVCHWKDVLLSGFQEPDRG 

SAMDYVRK\YAAIFL\VTLSVF\L\HV 

LHSAPDVQDCP\EFTL\RENPFLSPAR 

VPPILQCMGLPASPRANYPTPTKGPR 

KDGCWVPKEPSPSESPPCLW*PK1HI 

NRGPQ*MGGFPKVGEPTRACPLGS 

YLFIYPQIFKMFLPQVLSWMDWLDF 

LGIGKI 

1217 

6714 

B 

1282 

29 

200 

MSRTRLVCPSLIPFCIYVVDVGFSPG 
PQSCTSHEPKDIHAKCELAFLHHQR 
FYKNEGX* 

1218 

6715 

A 

1283 

138 

1908 

ASRTAVARWECVLQNVRREPSPSR 

AWPSQLRPIASTATKCREVCGPGYST 

PLEAMKGPREEI\VYL\PC1LPETQGT 

EGPRLSWATVDVDPKSPQYCQVIHX 

RLPMPN\LKDELHHSGW\NTC\GSCF 

G *LAPSRGTK\LVLPSFHLLFGI YVG 

GTWGSEPRAPKLAQGSLSPRDIHAK 

C\NWAFLHTSHCLASGEVMISSLGD 

VKGNGKGGFVLLDGETFEVKGTWE 

RPGGAAPLGYDFWYQPRHNVMIST 

EWAAPNVLRDGFNPADVEAGLYGS 

HLYVWDWQRIEMVQTLSLKDGLIPL 

EIRFLHNPDRCPKAFVG\CALQAPNI 

QRFLQRTRGGTLFSGRR*FQV\PPRK 

LKGW\LLPKMPGL\ITTILASPWNDG 

F\LYFS\NWLAWGP*GKYDISDPQRP 

ALTGQLFLGGSIVKEGPWQVLEDEE 

L/TSPSPEPLWKGKRVGEGP\QMI\Q 

LSLDGKRLNNHHGRCTSALGQSSF 

YP*SQSGERLLVNAGRVEW*DNSK 

KGGA*KLNPQLSWVDFGEGAPLPK 

PLPH*ARYP\GG\DCSS\DIWILNSPPS 

HPHSLFWALHFPGGPGLSFCISLGTR 

TLGKHVPTTAKLRLWQCVES 

1219 

6716 

A 

1284 

155 

336 

HFKHNRGWAPWFMPVIPALWEAE 

GG\HLKL**AMIVPLHSSLGDTVRLL 

KYLCIYSLIF 

1220 

6717 

B 

1285 

251 

570 

XELLVQLASLQTSFVTLDEAIKITNR 

RVNAIEHVIIPRIERTLAYIITELDER 

EREEFYRLKKIQEKKKILKEKSEKD 

LEQRRAAGEVLEPANLLAEEKDED 

LLFE* 

1221 

6718 

A 

1286 

83 

977 

HNQLTPVEEDTVESQFWSYCSLGCP 

AGDPSRKVIVRMSGKDRIEIFPSRM 

AQTIMKARLKGAQTGRNL/L*RKKS 

DALTSFDFRQ1LKKVIGDLKCLMG\E 

VMREAAFFTSWKPSFTAGDFQALT 

VIPKCSIKGGKLKVSE/DEGKVIVRP 

VFTFASNFEHYH*RN*PVMELTGLA 

RGGE\QLAKLKRNYAKAVELLVEL 

ASLQTSFVTLDEAIKITQQAC*MPIE 

HVIIP\RIERTLAYYSSQSWVRRERE 

EFYRLKKIQEKKKILKEKSEKDLEQ 

RRAAG\RCWS\LLIFLAEEKDRGSFL 

F 

1222 

6719 

A 

1287 

1 

3249 


1223 

6720 

A 

1288 

504 

932 

RYRCGVGSVLQAGGLILHLRKEGIL 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
(NO: 01 
nucleo-tide 
sequence 

SEQ ID 

ri\J* 01 

peptide 
sequence 

Me 
cno 
d 

SEQ ID NO: 

■ n ItCCXI 

in UooiN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /=possible nucleotide deletion; V=possibIe 
nucleotide insertion) 







HNKGKIEGMGLLEYVQGSLLQGW 

VMFVSVTAFFFSLLFLGMFLSGMV 

AQIDANWNFLDFAYHFTVFVFYFG 

AFF\MEAAATSLHDLH*NTTITGQPL 

LRDNQYNINVAASIFAFYDDS 

1224 

6721 

A 

1289 

66 

1363 

RTAVMPREDRATWKSNYFLKIIQLL 

DDYPKCFIVGADNVGSKQMQQIRM 

SLRGKAVVLMGKNTMHAQAPFEG 

TL*NNPSLWRKLLP\HIRGEFGLLFH 

PGRTLTEI\RDMFAGPIRLPTAARCW 

CQLPPCEVTVPA\QNTGLG\PEKTSF 

FPGL*VSPTK\ISQGAPIENPEVNVPA 

ESRTGDKSGKPSE\ATLLVNML\NISP 

FSLWGWSSSQVFD\NASIYKP*KCLI 

SPEETLHSR\FLGGCPQMLPSVCLQ\I 

GYPTVASVPHSIINGYKRVLAL\SVE 

TDYTFPLAEKVKAFL\ADPSAFVAA 

APC/AVVAPPAAPAAAAAPAKVEA 

KEESEESDEDMGFAAAVLPGGTKP 

GALPPLAPGLALPRGPRCDPEAFRK 

RFRSQPRQDGGGHAELQKFTLPSLY 

FFSPKPSGCEPRANAKSSLNFVFSKY 

SLSTYYEQGL 

1225 

6722 

A 

1290 

3 

231 

CSSSTSPSSEYCPSWGTEFRSGEMGS 
K*SSL*GASSSLSPDTGRGTGSSSSSS 
GGGGPGGVLGPLGCGLGGLPLN 

1226 

6723 

A 

1291 

221 

984 

ETGLMCSSPLDGQNSV*RLTPWTLA 

PGTTAEVNQEDQKKSQIL* KK WQA 

AANSSRLVKNS*MP\T*KIQNGRMD 

KNSPTKYLLSSRDSSYTHKESHKLKD 

IALILPSSSEGS1SELEQLSNSLPNKEL 

MTSICDCLLATLANSESSYNCLLTC 

VRTMMFLAEHDYGLFHLKSSLRKN 

SSALHSLLKRVVSTFSKDTGELASSF 

LEFMRQILNSDTIAPVCVAALSVSM 

CSHRSAPTWFPPKGLKGARTTSRNS 

SEA 

1227 

6724 

A 

1292 

817 

1826 

SCYPFCTS*KSFHSFCHTNDTVQSFK 

SI*LKR\SDT*SANRSSNNPLITPACSS 

PVTP*APHSASEDTNLVPALSCRTFK 

AIGWRSATCKFKACMVQSTNWPRE 

SCFVFFNCK*LNPFIIKLGISSKAVTH 

FAPLSSLYKSKSSVNFTEASIAPEFSL 

SGP*SFILLISISKWLYGQLKYKSGT 

SSLSYGSYVIGGTEAKSSAVYSTSKF 

GYLNVSRSKSAIPSSSLEICS*PSSPSP 

SSSSSPSSSSSSSSSSGILSTVRRSSSS 

SSSSSSTSTSSSSSSPSSCCSSSTSPSS 

EYCPSWGTEFRSGEMGSK*SSL*GA 

SSSLSPDTGRGTGSSSSSSGGGGPGR 

SSWSPWLWSWRASIKL1FFLPISMF 

WDSFSNLGCGLGEGGWRWEGGGG 

GRVSVMTLSLYPLDPYIASVRQLYQ 

PLSTRPSVFTFELGLKMMEVFSSYS 

RLPNLSKTGALGLPYIVEE 

1228 

6725 

A 

1293 

25 

771 

LDGPQGTSRPWGLPSLPPPRAGAGL 

SQGFGSSLRPARTPPSSGSKMSTEA 

QRVDDSPSTRA\QSSDGDQRESVQQ 


WO 01/79449 


SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


Me 
tho 


SEQ IE 


PCT/US01/08656 


Q ID NO 

inUSSN 
09/770,160 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon Tor last 
amino acid of 
peptide 
sequence 


Amino acid sequence ( X=Unkno\vn; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 


EPEREQVQPKKKEGKISSKTAAKLS 
TSAKRIQKELAEITLDPPPNL*VAGP 
KGDNIYEWRSTILGPPGSVYEGGVF 
FLDITFSPDYPFKPPKWTFRTRIYHC 
NINIQGVICLDILKDNWSPALTISKV 
LLSICSLLTDCNPADPLVGSIATQYM 
TNRAEHDRM\ARQW\TKRYAT 


1229 


6726 


1294 


449 


826 


1230 


6727 


QKSRRLIDDLFFFFFCETGSCSVTQA 
GVQWHNHGSLQPPPPGSVNDPPTSA 
SQVAGVSVACHHSRFVCARVCVCV 
CVCVCVCVESGLKLLGSSNSPSLAS 
QNARIIGMS*IVPKKLYFKSKVKGRI 


1295 


727 


NTEDQRNEEKAQREANKK1EKQLQ 

KDKQVYRATHRLLLLGAGESGKSTI 

VKQMRILHVNGFNGDEKATKVQDI 

KNNLKEAIETIVAAMSNLVPPVELA 

NPENQFRVDHILSVMNVPDFDFPPE 

FYEHAKALWEDEGVRACYERSNEY 

QLIDCAQYFLDKTOVIKQADYVPSD 

QDLLR\CRVLT\SGIF\ETKFQV\DK\V 

NFHIV*RGVGQRDERRKWIQCFND 

VTAIIFVVASSSYNMVIREDN 


1231 


6728 


1296 


1271 


PVRSSAPRRGHSVASAPRSGLRQVA 

GRRGAALPCSLAPGCGAAAGASPC 

PGAGRRRAAGGRCLACECTSLTCA 

GESGKSTIVKQMRILHVNGFNGEGG 

EEDPQAARSNSDGEKATKVQDIKN 

NLKEAIETIVAAMSNLVPPVELANP 

ENQFRVDYILSVMNVPDFDFPPEFY 

EHAKALWEDEGVRACYERSNEYQL 

IDCAQYFLDKIDVIKQADYVPSDQD 

LLRCRVLTSGIFETKFQVDKVNFHM 

FDVGGQRDERRKWIQCFNDVTAIIF 

VVASSSYNMVIREDNQTNRLQEAL 

NLFKSIWNNRWLR\TISV1LFLNKQ\ 

DLLAEKVLAGKSKIE\DYFPE\FAR\Y 

TTPE\DATPEP\GEDP\R*TRAK\YFIR\ 

DEFLRISTASGDGGHYCYPHFTCAV 

DTENIRRVFNDCRDIIQRMHLRQYE 

LL 


1232 


6729 


1297 


235 


1571 


GRPRPPPPQPGRAPPPPPPRMGCLG 

NSKTEDQRNEEKAQREANKKIEKQ 

LQKDKQVYRATHRLLLLGAGESGK 

STIVKQMRILHVNGFNGEGGEEDPQ 

AARSNSDGSEKATKVQD1KNNLKE 

AIETIVAAMSNLVPPVELANPENQF 

RVDYILSVMNVPDFDFPPEFYEHAK 

VLWEDEGVRACY\ERSN\EYQLIDC 

AQYFL\DK\IDVIKQAD\YVPSDQDL 

LR/CARVLTS\GIF\ETKFQVDKVNF\ 

HMF\DVGGQ\RDERRKWIQCF\NDV 

T\AIIFVVGSSSYN\MVIREDTGHNGL 

AGRL*TSPKGIWDNRWAAAPSLVIL 

FLTKQ/EILLA*ESPLAGNSK\IKDYF 

PE\FAR\YTTPEGCYSRRPGEGPHGV 

YRGQVTPFEDEFLRSSNCPVEDGRH 

YCYPHF\TCAVDTENI\RRVFNGLAV 

DIHFSGMHLSFSYGAGFKEGEPPKF 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ1D NO: 
in USSN 
09/770,160 

Nucleotide 
locstion of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 

codon* /=nn^Qihl£ niirlpnHflf* rfolAfinn* \=nnccihl» 

nucleotide insertion) 







NLKA 

1233 

6730 

A 

1298 

176 

505 

1LKFPWID1FLYSFNLVFLIFETRPHFI 

PTSAS*VAGTTGTQHHAWLI*LFGE 
RELHHVTQAGLKLLS\*VILSPQPPK 
VLGLQA 

1234 

6731 

A 

1299 

310 

326 

SQHFGKLRRVHSLNSGVQDQPGQH 

LRRLGWEDHLNLGGRGCNEPRWC 
HCTPA*VTE*DSISKNNNNN*QAGV 
QWHHLGSLQPLPPRFK 

1235 

6732 

B 

1300 

99 

430 

XV1TQRELVSRQVSNDLTEQAATFG 

1 IT r\rw/ci TVI T'ET"' T/"T7"C*TT7 A \7C A vr\ 

LlLUUVbLl YLlrOKErlbAVEAKQ 
VAQQEAERARFVKEKAEQQKKAE 
QQKKVEQQKKAAVISAEGDSKATE 
LIANSLAHRGGPP* 

1236 

6733 

A 

1301 

64 

967 

NFRVEAGVRGVQQKETCAFKVLESI 

GKLG\LALSVAGGAENSALYNVDA 

GHRAVIFDPIPGQK*QDIVVGEGTHF 

\LIP\WVQKPQLSNDCRSRPRNCCQS 

ITGSK\DLQNV\NITLRHPSSGPVRQP 

SFPRIFTSIGEDWDERVLAVPSQLEN 

LKSVVAPFDAG\ELITQR\ELVSRQV\ 

SDDL\TERA\ATFGLILDDVSLTHLTF 

GKL\LTEAV\EAKQVAQQGKQRRAR 

r V/ VLbKAbQQKJCAAIISAbGDSJCA 

A CI f ATvTCT ATA/^TVST TT7T DVT C A A'r? 

ADblAPobA J AUJJubibJ^KJsJLbAAb 
V 

1237 

6734 

A 

1302 

424 

598 


1238 

6735 

A 

1303 

3 

371 


1239 

6736 

A 

1304 

1 

1596 


1240 

6737 

A 

1305 

2 

556 

WDMMYVTRFASFLRNVLPSFISDW 

LYVQKMNTWFKHENYGLMPLNGY 

LKMEIFFIQKRGALI**IYLSIKPSVK 

EFTETSAVFEDGTMFEAIDSVIFATG 

YDYSYPFLDET1MKSRNNEVTLFKG 

IFPPLMEKPTLAVIGLVQSLGAAIPT 

ADLQAWWAAKVFASRWAILSFIHFI 

NEHLLNTCY 

1541 

O / jo 

A 
A 

l j\jO 


1 1 C7 

I lo/ 

Ir r rrr rKMbbCPJr AQAG VQ WCDLG 

CI OAT PP/^ITTPCC/^f Of T CCU/TIVDDD 

PPHLANFLYF* *TWVFTVLARMVSI 
S 

1242 

6739 

A 

1307 

6345 

9041 



O /HU 

A 

A 

iDKJO 

OTA 

Zoo 

All 

\ 

LL 1 LKW&Hasn VLLK. 1 KuQrRRSG 
WCL*SH/HFGRPRRADHLRSGVQDQ 
PGQHSEIVSTKNTKIGWAWWQVPV 
IPAT*E 


fJ!A\ 
0 /*H 

c> 

1 J\)y 


All 

433 

MnLULrLNKlLFl^IRUVbsECjSSRL 
HPLPDPRGDRHRMADNLPMEIHGS 
SATSSGKPSDFNQAAVDGAGERGG 

KC V LOO 1 L.U V Lr* 

1245 

6742 

c 

1310 

11 


MVSTOLROASDPRTTinRFRPFT T 1 

RRVDKLMSPRLPTGTLNPHHFWTL 

SIPQVGRCNAP* 

1246 

6743 

A 

1311 

225 

735 

GELRVNSLHVSTHFQ1PEETDIGWL 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in LfSSN 
09/770,160 

Nucleotide 

Inrntinn nf 

first codon 
Tor peptide 
sequence 

Nucleotide 
Iftffitinn of Inst 
codon Tor last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 

LUUiiii* /***|ju39iuic uuutULlUc UClClIOn^ V™"POSSIDIC 

nucleotide insertion) 







VSPGQGPARPFEDIQLWPPGSLMAA 
EPTDQSLEESH*DRWITTFFTFAR\Q 

LKDVGSLDRKN*GAWDVNQDFGG 
SRFN\EY\WRLIGGAWPKEIRKEEKT 
LKIQERSKAAWLEDGVGQGRT 

1247 

6744 

A 

1312 

165 

629 

TGEMLIRKHCLY1VCKLSLLFIFRDM 

SCSVAQ\AGMQWHATNAVDGD*LT 

ASLKLLGSSNPPPCNPKRAGIIGMH 

l^HK^LlrUWorr'Kv^ArLKI 1 KALKN 

FKKI*FQNTLVTGKY*HCSLTFFFCH 

FNENR1LHLLAQMAKQDSGLLRLV 

A WE A 

1248 

6745 

A 

1313 

76 

523 

ELGRNLWNASQGRGLEWVSNRKF 
WAVYAYITFLSHPERVVNKHNCPL 
FE* KGEC WEW*EARAQT\DRVQKQ 
MWLLRIQrSFSSRKETQSLRVTFWR 
i rvirarJoMK'MorUN&i K1LYK.R 
NVPLKAHFPTAVLAVVPPAVTNQG 
KEQG 

1249 

6746 

A 

1314 

55 

391 


1250 

6747 

A 

1315 

996 

1334 

WASVGLSGPRSPPSSRPQ*ARPRPG 

APAbLRQADLGRGWRDRLGAPRPR 
rPK 1 GO WKSCGRGRGPGSRPRGAR 
AGLGPGAPGGWRRSRRSWTRARA 
ATRPRAAARGSRTPRG 

1251. 

6748 

A 

1316 

1 

993 


1252 

6749 

A 

1317 

72 

496 

PPWARGSARRPPAWRTVRMPSCHP 
RMFGAPQKTFLRVSVWSRCRPWGI 
VMRMM*PMRGQVRRHNSCMAPKT 
EE*NPTVSATFCCCSFVSCSWPPVTR 

VCCTT UTA A\A 

Y£>MLr I A AM 

I ZD J 



1 J 15 


i/o 

A/fTDVI TT T?T CDT DDI/ z^ 1 r?! 11//^ T T I PI t* 

M 1 r YL I LrLbrLPPKGEIWGLLLFLT 

PLGFLLPSLPLLLPCPAPAGVRRQW 

DGPTEGA* 

1254 

6751 

A 

1319 

1 

1541 


1255 

6752 

A 

1320 

9 

345 

YLSEVGVSVGIVIRPRQWIRRPEGDP 
FHGGRLKMDPLRAQQLAAELEVEM 
MA\DMYNRMTSACHRKCVPPPF\KE 
AELSKGESVCLD\RCVSKYL\DIHGA 
♦WGKKFDRVVLLQG 

1256 

6753 

A 

1321 

199 

985 

VRGSGADPGGRLCSASVRRGGPLE 

GAFNSRTRQATMTENSTSAPAAK\P 

KRAK\ASK\KSTDHPQVFQT*IVACN 

SSPRRTGAGS\SRQSIQ\KYIKSHYKV 

G*ERLTSQI\KLSIKRL\VTTGVPSRQ 

TK\GVGASGSFRL\AKSDEPKKSVAF 

KKTKKEIKKVATPKKASKPMCKAAS 

I/* A "DXTTrnXTTlV CO TIT TTSTS A \ T/WT A A T»/~» 

KAPN PRNPKoSP VKKAxKKKLAAPG 
FKivA\iNK.r Jv I V lw\KPGKAr IvAHWR 
PY L V* PKCKCP V AKEGRARRKLTMI 
VFFLRDTPSWSPIFCK 

1257 

6754 

A 

1322 

3 

129 


1?S8 


A 

A. 


j /o 


lNlL»rirvJL#r l rlxclv 1 o Y /VJrr V VKivrrlolv 
GRHTPFPALKWEWVWRKKGPGEG 
DSDMALSYPSPPPPPPPPMVMG*/SP 
PSH*PLLLCRRVKKEGF 


WO 01/79449 


PCT/US01/08656 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 

1259 

6756 

A 

1324 

14 

262 

FQTQTKYQAPLHPAPGHRPTPLPPP 
PPASHCHLFLWSVCKRSNKPQAPFI 
SP\PPSNPLPEPGPCSSLCQPWGRTG 
EGLAPPD 

1260 

6757 

A 

1325 

234 

275 


1261 

6758 

A 

1326 

52 

•454 

sqtqreptmvlspadktnvkaa/w 

gmflsfpttktyfphfdlshgsaqv 

kghgkkvadaltnavahvddmpn 

alsalsdlhahklrvdpvnfklls 

hcllvtlaahlpaeftpavhasl.dk 

flasvstvltskyr 

1262 

6759 

A 

1327 

3 

616 

PTLLVPTDSERTHPWLLSPADKDQR 

QGPAWG*G*GSHPPSNVAKTLER\M 

VLFPPPTPKPYFP\HFDL\SHG\SAQV\ 

KGHGKVKVADALTNAVAHVNDDMP 

NALSA\LSDLHAHKL\RV\DPFNFKL 

L\SHCLLG*PWAAHLPRPSFTPCGCK 

ASL\DKFPGLFVEAPLLEPSKLPLKL 

GSLRLAMLLCPFGPFPQPLLPFPAPV 

PPWSLK 

1263 

6760 

A 

1333 

732 

1634 

RRFWQVENHEILTEQAFVGQKPIFR/ 

MKSLKEKLKTLATKRKPKN1GDILT 

KRKNNRTGSEAGEPQRWKPRRCPI 

RRAQEKNSQRPRKRREGKVPIWRK 

KPLKTGRDRSW*P*KLLALCPSTVG 

IRSRAG 

1264 

6761 

A 

1334 

1 

531 

FFFLHVVAHLIFTATQMGSNILITVL 

QMRSPRPRRVEYLAQDHTPFATPHS 

KSHRQLRLSTSMMVTYHLPLQTISS 

LVPYSQWGHLQSPGPKLGNRKPFL 

TPPSPSPTPSKWNP\SPGTMYFFRVLI 

QSPLSSPRPRC*PIVRCPAAAPGSSLP 

STLFAGPSPPFPASYTLRLRCAF 

1265 

6762 

A 

1335 

130 

361 

GACCPLSPGGESGT*C/SPRGPRKPS 
GPSPKAKCS*QSTGCGQCASSPPSPS 
PGRAPCDSPILACDLGQGGFLQLHT 
H 

1266 

6763 

A 

1336 

144 

224 


1267 

6764 

A 

1337 

1111 

1200 

PGWTLGGGACSEPRCATALQPGR 

ESETLP*TLHSHNQQSKPLPPICYPIS 

VNGNFILLVAQVKKPCKAGCGGPR 

L* SQHFGRPRQADHLRPEVQDQTG 

QHGKTASL/LKNTKISQVS/WTLGG 

GACSEPRCATALQPGRESETLP 

1268 

6765 

A 

1338 

344 

512 

ETLYTME*YTAIEKNEIMSFAGTWL 

ELEA\VIL\SNLV*EQKTKMLPCSPL 

MGWELK 

1269 

6766 

A 

1339 

274 

393 


1270 

6767 

A 

1340 

1 

561 

VRSAVGGTGMSSGASRKSWDPGKP 

WPPDWP1TGRK\MK\VLWAALLVHI 

SCQNPMPKW*QAV*NKSRSPELR\Q 

QTEWQ\SGQ\RWEL\ALGRF\WDYL 

RWVQTLSEQVQEELL\SSQVTQ\ELR 

ALM\DETMKE\LKAYKSELEEQLTP 

V\AEETRAR\LVK\EPQAAQARLGAD 

MEDVRDRLVHYMFDV*AII 

1271 

6768 

A 

1341 

1 

746 

MAAAGAFRLRRAASALLLRSPRLP 


WO 01/79449 PCT/US01/08656 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQ 10 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknoivn; *=Stop 
codon; /=possible nucleotide deletion; V=possible 
nucleotide insertion) 







ARSCRPRPDSITRSPPDVRLPLEKQL 

KNAINQRGTKGPYIRYYPEVVDHY 

ENPRNVGSLDKTSKNVGT\GLVGAP 

ACGDVMKLQI\QVG*KRGRFVGC*G 

FKTFSAVGSAIXASSSLSHLNGVKGK 

T\VEEALTIKNTD\IA\KELCLSFPWK 

LALAPMLGLKVAFKAALADYKIET 

RTQKKGEAEKKWSPPLGEASSRPTP 

AVPQPAVPVTLDVSGSRLPSPTEGA 

L 

1272 

6769 

A 

1342 

160 

381 

SWGLDSALGVSDTTHHPKVVGRAK 
HRGFLKWLPPPPGPPLGKRNCQPYP 
NP\QLPESGPPQQYFYLKCCPFYEL 

1273 

6770 

A 

1343 

653 

814 

WDSTPLPSSFTDWMLWPEKRQVGA 
CSHPTPNFL/C*GKLNISPNSPCQPKV 
LDLT 

1274 

6771 

A 

1344 

954 

1245 

DLSLTTSRQRKHEFKEVILPNYTHD 
KLYL1NSNHPTKMCP*NQVLKKLIIK 
ITQQWNTGAMIHAYNPSTLGGRWG 
WIT*SQELETTLPHIRGTLIHKI 

1275 

6772 

C 

1345 

155 

431 

MRERGLGPLGVFRSDAGLKSLPEG 
RRESGALEECVIHTQRVCDWKVPE 
HPLAPLLLVSVAVLRILAKPLPTQR 
DLFLLLSRSSLFNQHFPLC* 

1276 

6773 

A 

1346 

2 

70 


1277 

6774 

A 

1347 

1 

271 

LFF*TESCSVTRLQCSGMISAHCNLH 
LPGSSNSPASASRVAGTTGARHHAQ 
LIF/VVFLVETGFHHVGQDGLDLL/N 
LVICPPRPPKVL*LQA 

1278 

6775 

A 

1348 

117 

229 

KKTMFRQKLFYKRKQLQKGPRP*G 
SAKKKLCFGKSYF1RGSSFRKGQGH 
EERGLKYKKKTGEGV 

1279 

6776 

A 

1349 

255 

381 

IHFSINSQLFSPSSPPNKKG*FPKVT* 
KR/P*NGPRYIPQRFP 

1280 

6777 

A 

1350 

72 

577 

TTGVRGALQREGGSLPTQPQGERA 

LGGRRRNQPAGDPGFA\G*RGA*WP 

TGCKQGRGGSPSPQPLGSGGR\QNL 

ARLKPPRPPP/PQGEGDKGRPRQQG 

PSWRSKSSPPPILPTSYPGGDKGWP 

KHKLEAVNSVRARSSVSRNKCWSN 

RMDIALRCPVTVAHELTYSDAL 

1281 

6778 

A 

1351 

160 

336 


1282 

6779 

A 

1352 

1625 

1920 

LSFPAKKGNDCLPHVPGLSFKGRVP 
ESRQEFCETQESSLENPRLCIPV/CHP 
WAPWRWEPGETAEPPVLRRKEAF* 
PGPPCPPPNRVSETREAVLLQPRI 

1283 

6780 

A 

1353 

9 

249 

KVHTKPRTSSICRSHIGFCLFF*ETES 
CSVTPPGVQWCNLG\SLQTPVPPG\SS 
DSPASAS*VAGTTGMHHRAQLII*\F 
YVFTGFFWQ*QLSDGILTH 

1284 

6781 

A 

1354 

426 

744 

DNLLKSFHLIIS1NCLFFCFFFL*TESC 

SV\SRLKCSGAILAHCNLCLLGSSDS 

HISASLVAYRCLPPCQANCFFSFLAE 

TGFHHVVGQAGLELKIQKLAECSG 

MRQ 

1285 

6782 

A 

1355 

184 

422 

GY1QGSPALVSTHGGHAV*PPTQSK 
PTEHAHPVLQPRRWARTRVGEGIPC 


/ 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 







ahpspscsprplatqkrfqanrr.gr 

LEGLC 

1286 

6783 

A 

1356 

3 

1221 

HFIYLFIFIFLRQSLALPLRPECSGAIS 

AHY/RHLCLPGSSDPPN\SASRVPGT 

TGMPRCTWLIFVFLVEMGFHHVGQ 

AGLKLLTSGDPPALASQNGGITSVS 

HRVQPKNFFE*SRYYYLNFKGN*DS 

KRLKK*KIRQLEN/RRNMIQILLLFF* 

L*NSCSRMKDNS*AREKYLQHILRG 

LRFLTCKDSTKLDLKATNRNTTLE* 

MENMMKQFTEEKIQMPNKDLKRC* 

TSLGVREMQIEVIRYYFSRSS*QKFK 

RVV/IIPGQLQRKCQF/PCTW*KSKPL 

KRFSEEI*YNINQIHLFEAGSCCVVQ 

DRVQ*YHHSSL*PQPPRIKQSFHLSL 

PTSWDYWHMPP/HPS*FCVCLFCFF 

SRDEVSPGCSSWS*TPELKLSSHLNL 

PKCRDARPEPPCVAQYDINQNCKK 

HVLS 

1287 

6784 

A 

1357 

703 

918 

TREVEVAVSRDHATALQPGQQRQD 

SVSKKKKKGRVQWLAPVIPSTLGD 

QGGWIT*GQEFETSLTNKVKPHLY 

1288 

6785 

A 

1358 

273 

561 

DRQAEKKRNIL* *NEIVLDLPR*PHT 
AS*ISSTT**FENTALRPGVVAHACN 
PSTLGGGA/GQIT*GQEF\ETSLANV 
VKPISTKNIKINWEWWCRL 

1289 

6786 

A 

1359 

375 

663 

LITKQLGLGVVAHVCIPQHFGRPRW 
ADHLSP\GVQDQPGQ\QGETQSPQK 
NTKISQVWWHVPEVLAP*KAEAKG 
LLEPGRQRLPPTLHSRLGGSETL 

1290 

6787 

C 

1363 

79 

153 

MPDMIRQHSSCLIKRVYLTRWFGR* 

1291 

6788 

A 

1364 

4776 

9134 

WLHDSVN1LKNIEHFFFFFFWRQSL 

APSSRLECSGAISAHCNLHLPGSSDS 

PASASQVAGITGAHHHAQLIFVFLV 

ETSSHHVGQASLKLLTSSDPLALAS 

QN AG ITGMSHFJTWPNH*IEHF* WV 

NCMVWEHYLNKAILKFYKEIKSRR 

DPTPKAYLIWGR*C*/TVL*NKEQVT 

DTKFLT*LFKGVGRM*KFTVPAPEV 

VRVS* L YLA*DKCLVI1VAMLMDRN 

VIFLKGPLMGFVG*LCSGFQGKEWF 

LFESYPSLPTSPTTLSPALPKYFL*VN 

VTKANNFLQ*SIPFYIMVLTFPST*S 

QYTNFFFFSF/HFFFFFF/CFFETESHS 

VTQAGVQW*DLGSL*PLPPRFQ*FS 

CLSLLSSWDYRHAPLRPANFCIF**R 

/SGVSPCWPGWS*TPDLK 

1292 

6789 

A 

1365 

3 

287 


1293 

6790 

A 

1366 

75 

301 

AHKKLRAAPLAPMTGTH*K\VFVK 
AGDKVKAGN\PSWVMFPMKMEHTI 
KSPKDGTVKKVFYREGAQANRHHS 
FSRV 

1294 

6791 

C 

1367 

155 

377 

MKASSFKKLQKFYIDPYKLLPLQRF 
LPRPPGEKGPPRGGGRGGRGGGRG 
GGGRGGGRGGGFXXXXXXXXXGF 
RG* 

1295 

6792 

A 

1368 

363 

423 

S1KGTENGFLIVL*PKSQV 


WO 01/79449 PCT/US01/08656 


SEQ ID 

nucleo-tide 
sequence 

SEQ ID 

peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in IKSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 

Inflation Al* lact 

codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 

PfUIAn* /isnnCBtlita rhaUaIuIa ^AliktSnfi* \— nneeSftlo 

luuuh* / puasime nucieouuc ucicuun, \ — pussiDie 
nucleotide insertion) 

1296 

6793 

A 

1369 

157 

723 

KTLWRYGNRG/GFQRGGST/RAGGC 
/GAQGQGPEGHVAPY*GE\FLHPCE 
DDIVCKCTTDENKVP\YFNAPVY*E 
TKEQMGKVDAIFGQLRDFYFSVKW 

OCM1V4D1 cccwi ^rtvirvDwi I T»f 
ocNMKL/Obrls.ls.LV;lS.r YlJJryjtLLrL 

QRFLP\RSPGEKGPPK\SGGRGGRGG 

GRGGGGRGGGRGGGFRGGRGR\EG 

GGFRGGRGGGFRGRRTLSETC 

1297 

6794 

A 

1370 

1 

810 

ASPPLVCTHARSFSVLAPLFLHFPLL 

PDRRSRSFRAVHFWGPRSSPPHAAV 

RLREARRGRDRREKAESPTGEKSTS 

PSSSRRQRGPPTKVRRPPAPFTMQP 

ASAKWYXDRRDYVLKWEFVFLNTL 

RDV\NVNF\EKSKTYNFSCLGG\SDN 

F*GILNEIDLFH\CID\PNDSKHKRT\D 

RS1L\CCLRKRRIWASSWPKGLTK\E 

RGKGLI WV\S VDFN\N WuCDWEDDS 

UJbUMoINr UKr bt\MMNNMljuU\bV 

VDYPEVDG\ADDDSQDSDDEKMPD 

LE 

1298 

6795 

A 

1371 

1 

588 


1299 

6796 

A 

1372 

179 

989 

KWRNQSWRWPKPGTNWMLSCSVC 
. WRRVTWTGSVWMRKLGKHPQTPT 
/IKDCSIAATGKRPSARFPHQRRKKR 
REMDDGLAEGGPQRSNTYVIKLFD 
RSVDLAQFSENTPLYPICRAWMRNS 
PSVRDAECSPSS\PLPPLPEDEEG\SE 
VTNSKSR*CVQACPPTHTPGGQPKN 
ACR\SR1P\SPLAALRMQGTP*RWSPF 
EPEPSPSTLIYRNMQRWKRIRQR\W 
RPPACLPLVUPFA 1 AESPLCLSRWK 
EASHRNQLRYSESMKILREMYERQ 

1300 

6797 

A 

1373 

245 

336 

HIPSQGQDGK*VKNRARKIVSYSRG 
GWHSG 

1301 

6798 

A 

1374 

127 

872 

EATGQTVGTVPSSLSRPRPPLHSSSG 

GVRIQTLFATSRPLDKTASIFLVLSN 

ACIF*KILIA*KEIQDS*SHH\*K*LLLI 

RLDFLSSFFPP*L1N*FFFFPGNSLVH 

RLFFFFNLGMVAHTYNPSTLGGQSG 

RIA*A*EFKASLGNLVGPHFFFLGGG 

VOYKSPFJKPJLNKKKTKSLQKKKR 

VDVD1DT 1 IJA/IA/DVT? 17 VXTl/'.OTYrvTD /''XTJ 

Y KKKlrJLJLHM VrxsJsJiJvNJ^uJJNKyrl 

AfVMK' A QPQ VPFMVO'MTQlf/TVTVI V 
/\\{K{l1SJ\aro i rrlN V V^lN 1 oiSAJ i 1 I JLK. 

1302 

6799 

A 

1375 

37 

80 


1303 

6800 

A 

1376 

1452 

1700 



OoUI 

A 

A 

1 i II 

ilo 

mi 

373 




A 

A 

\i la 

144 

356 

r QFKL i bLKINb 1 DVHRCPQLPPPPPXS 

PPPYPGPWGMGCGRFPKGQQGPRP 

PRLSGLG*FLSFFALGFSDG 

1306 

6803 

A 

1379 

860 

924 


1307 

6804 

A 

1380 

263 

616 

TTCFSFESRASCHFHVASAVSPPTPL 
CSPATLMAQDKAGKPSPRK/PSLAP 
JDNJr LijrLror JTCJtlor Jvr JtwYJnn. V cor 
SGPSPTRAKKRV*FSSPPSLWGQEPR 
GHAECPLPYLGEGAPA 

1308 

6805 

A 

1381 

152 

272 

YSVLKFNYEILKNCFSFFV*VSEPFF 


WO 01/79449 


PCT/US01/08656 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 

inUSSN 

09/770,160 

Nucleotide 
location of 

fii"ct rnrlnn 

til 91 IWIUH 

for peptide 
sequence 

Nucleotide 
location of last 
crulftn for In^t 

amino acid of 

peptide 

sequence 

Amino acid sequence ( X-llnknown; *=Stop 
codon; ^possible nucleotide deletion; V=possible 

niiplpntiflp inca»i*tinnl 







KT1ILVYFTLTEKL 

1309 

6806 

C 

1382 

71 

316 

MCHHSQLTFLYFVKTKSHCVAQAG 
LELLASSDPLDSTSQSVG1TGMSHH 
TQPL1LFFNFSVSVKYTK1MVLKNGF 
RNSYKK* 

1310 

6807 

B 

1383 

165 

520 

XNLKLLDNWDSVTSTFSKLREQLG 

PVTQEFWDNLEKETEGLRQEMSKD 

LEEVKAKVQPYLDDFQKKWQEEM 

ELYRQKVEPLRAELQEGARQKLLP 

VLESFKVSFLSALEEYTKKLNTQ* 

1311 

6808 

A 

1384 

3 

558 


1312 

6809 

A 

1385 

3 

118 


1313 

6810 

B 

1386 

28 

384 

MKAAVLTLAVLFLTGSQARHFWQ 

QDEPPQSPWDRVKDLATVYVDVLK 

DSGKDSVTSTFSKLREQLGPVTQEF 

WDNLEKETEGLRQEMSKDLEEVKA 

KVQPYLDDFQKKWQEEMELYRQK 
* 

1314 

6811 

A 

1387 

2 

1093 

GGASCCLPRSLWLPSSRFRPCPRPG 

LWVPEVFSRSVPFSSPGCNEWGSTG 

LLHAEGTPLSQALLLLQVPHGPFRM 

KAAVLTLAVLFSDG*ARRRHFWQG 

G*SPPRAAWDRV\K\DLATRVPWTV 

LKEQRTETYVSQFEG\SALGK\QLNL 

KAPLTTGDSVDLPPFS\KLREQF\GP 

C*PRDFLGINLGKRETEGP*GKGR*G 

KDLWKEVKAKVAALTLDDFQERS 

WQEEIGAFTRQKV\EPLARKNFQEG\ 

ARPESLHELARRSLSPLGEEMRDRA 

RA\HVDALRT\HLAPYSDELRQRLG 

AR\LGALRENGGARMGQYHA\QAT 

EHLSTLSEKAKPALEDLRQGLLPVL 

ESFKVSFLSALEEYTKKLNTQ 

1315 

6812 

A 

1388 

1 

2076 


1316 

6813 

B 

1389 

560 

752 

XSVAAVTALNSESNFARAYAQGISR 
TKYWELIYEDSMDLIAKLPCVAAKI 
YRNLYREGSVTRGH* 

1317 

6814 

A 

1391 

1031 

1407 

CVGGRAQVEKEGAALRLRPATVPR 

ALMSLSSLVKPETA/ALCGTEMQDF 

FTPLLCDFKS\PGISFCKGT*MCLRSC 

♦HNVRRWR/DQPTPVSTVTPVPTLT 

CVPSPIIFPVPER\GAGCLHLCLKFM 

VH 

1318 

6815 

B 

1392 

97 

f 

880 

MAPRTVLLLLSAALALTETWAGSH 

SMRYFYTSVSRPGRGEPRFISVGYV 

DDTQFVRFDSDAASPREEPRAPWIE 

QEGPEYWDRNTQICKTNTQTDRES 

LRTAARLLQPERGRTMYGCDVGPD 

GRLLRGHNQFAYDGKDYIALNEDL 

SSWTAADTAAQITQRKWEAARVAE 

QLRTYLEGTCVEWLRRYLENGKET 

LQRADPPKTHVTHHPISDHEATLRC 

WALGFYPAGDHTDLAAGMPRTKL 

RNTELVETRPAGR* 

1319 

6816 

A 

1401 

1380 

2180 

EIQSEWNLQDLXELQLEERLAGAWE 
E/LSFRAVRMPSP\FRSSALMG\MCG 
SRSADN\LSWPFFH*NVMEPVTELM 


WO 01/79449 PCT/US01/086S6 


SCQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; V=possible 
nucleotide insertion) 







QEQSYLKSELGLGLGEMGFEIPPGE 

SS\ESV\FSRAQQDSFCECSGPSNATR 

KMGVPS\TASVGKSKTPLVARKKVF 

RASVALTPTAPSR\TGSVQTP\PDL\E 

SSE\EVDAA\EGSPRKL*GPKSESEW 

EKEGHGKLPS\MPAAEEMHKN\VEA 

RLSLQQVIR\EIKESIVGEIR\REIGKL 

DFWAAVSS\SKASNSR\QDYH 

1320 

6817 

A 

1402 

119 

301 

INDKRKKKRPARPGAGGLHLQLCL 

SQPPQPRGHPAPIPTGQAGPRDSGP 

GASP*/GRDPPSD* WTPADLGSDPW 

AGPLPTPQEP*GSRWPSSATVSLSAS 

TATGTPCTYSHGTGWTQRLWTRGL 

PLSRDPPSD 

1321 

6818 

A 

1403 

1451 

2495 

RGLAGNFEDRKSAHYVFQTFRGGE 

RRSLELEAHLEGWSLGLRFLLGPLK 

GPPA\QGHFHPSLPISSWRGAGVPHS 

R/SPFPTLG1PG*1FPPKPGRRPRGPPR 

KEDLGPGMVG/RPSGPLPQLPSAVL 

SADPAGPRPHVPFCEP/SPSHGVRAS 

PGSKWVEEEIGGEEGRQ/PKCRQAF 

QEAWLMQG/GARGQGLPGS/GCWR 

INKPSKPSKRGGKGLTCQTFSTNIC* 

SPPLMPRSLP\GPSFILHLISSQQP*SG 

LLFIDPIPPEKGRGGLSERWGRAFG 

DSVACSFQKPTPGPWEVFEQDAWP 

NPWP/QGPPPENFPKGNPSHSRNIHK 

GDEQSPVRTKTEPTPWGGKHSQFA 

SR 

1322 

6819 

A 

1404 

3222 

5798 

PLLTPLVSKVTAAGVPLFFFFFFFF* 

DIVSLCHPGWSAVV*P*LTAASNS*\ 

VKQSSHLSLPSSWDNRYAPPRPANY 

FYYFYFL*RLDLALFPKLLLNCWAQ 

VILPSQPPKVLGL*AQSSEGGIHSGL 

SLPSPCFLLCNPI 

1323 

£ OTA 

6820 

A 

1405 

38 

402 


1324 

6821 

A 

1406 

2 

380 


1325 

6822 

A 

1407 

1 

477 


1326 

6823 

A 

1408 

1 

1104 


1327 

6824 

A 

1409 

524 

1584 

IVKMEKYSIMKSMNMHLTERKKDH 

FRNDTNTQSFYREKWIYVHKESTKE 

RHGYCTLGEAFNRLDFSSAIQDIRTF 

\NYVVKLLQLIAKSQLTSLSGVA\QK 

NYFN1LDK1VQKVLDDLH1SLLFKDL 

PQVLSSNLCT\LIRGVGKSVLVGNIN 

IWICRLETILAW\QQQLQGLQMD*G 

K*TMGLTL\SDLPLAHGWNNILLPV 

FQDGWGHSFTFRPR*PPRLYMAVG 

EDRQLW\KKL\CQYHF\AEKQFCR\H 

LILSEKGHS/VEWEVGCNFATFRKH 

YPAKEQYGRQHCIFCRHCSILFWKD 

SGHP\CTAADPDSCFTP\VSSQQFIAL 

FQVLRACPLPIPYWRFVNPCCPVQG 

¥ i\ rot 

LIVSVL 

1328 

6825 

A 

1411 

588 

855 

VLLSSYLTQYSLVFICWLFICLFICIFT 

FMYVTM*IWFAVVFVWNLVFPYKV 

VK/TPWRSRIHVHCLYF**NYSPYAF 


WO 01/79449 M PCT/US01/08656 


SEQ ID 
NO: of 
oucleo-tide 
sequence 

SEQ ID 
NO: of 

npntirip 

sequence 

Me 
tho 
d 

SEQ ID NO: 
inUSSN 
Damn inn 

Nucleotide 
location of 

Ural IUUOI1 

for peptide 
sequence 

Nucleotide 
location of last 

CUUUI1 lt/l laJl 

amino acid of 

peptide 

sequence 

Amino acid sequence ( X=lln known; *=Stop 
codon; /^possible nucleotide deletion; V=possible 
nucleotide insertion) 







LYSA*CYSLCVH 

1329 

6826 

A 

1415 

277 

509 

YMLSCHHTFVQTHRMYNTKHEP*C 
KLWTLC*DNLSM*VHQL*KFPTLVG 
DVDS/ERRLCMYGDRGYMGNLYIL 
LSILL 

1330 

6827 

A 

1417 

15717 

16041 


1331 

6828 

A 

1418 

41 

544 

TKLVMMQKLLKCSRLVLALALILV 

LESSVQGYP\TRKPRHQWVRCNPDS 

SSAHCLEEKGHMFELLPGESNKIPR 

L\RTDLFPKTRIQ\DLN\RIFPLSEDYS 

GSGFGSGSGSG\SGS\GSWFLTGNGN 

RNYQL\VDE\SDAFQ*QPLGSLDRNL 

P\SDSQDLGQHGLEEDSMV 

1332 

6829 

A 

1419 

168 

467 


1333 

6830 

A 

1420 

2 

196 

ASTRSRRSGSRGLTRRAAFGVRAGE 

GWVCGGPAGSRRRRKLPLTGPGSG 

SFQCRSRGGRGSVNMGKGDPNKPR 

GKMSSYAFF\VQTCREEHKKK\HPD 

FSVNF\AEF\SKKCSER\WKTMSCKG 

RSSKF*RIWAKS*QSFAIDR\EMENL 

RFLPKGDKKGKKKDPN\AP\KRPPS 

AFFLFCSEHRPKIKMEHPGLSIGDTA 

KKLGEMWSEQSAKDKQPYEQKAA 

KLK\EKYE\KDIAAYRAKGKSEAGK 

KGP\SRPTG\SKKKNPPRSWRGWEG 

*EA* * DPASGI WGAGRRGLGL WRA 

GGQSEEAETAPDRARFWEFSMSVT 

RWTRICQHG 

1334 

6831 

A 

1421 

3 

107 


1335 

6832 

A 

1422 

450 

851 

KTEFTQNYFWKSCTGVDGFFFSILF 

CLFV*DGVLLCHPGWECSGCDLHS 

LATSASQVQAILVPHQPSK*AWDYR 

RAPPQLG*YLYC\CRDRVFTMFVPG 

LSNFW\PHVI\HLPQ\PPKVLRIIGREP 

TCAPASMAF 

1336 

6833 

A 

1423 

6 

461 

AEMTPLHSSVGSRVRLHLKKKNYI 

KKPRKALFISIFGIIFFHLIYVSTYIYL 

STFSFTSLILMEFILCVV*VLHLQLK 

NAILMAYWY/TFILITWLPSSEEDLK 

VL*LFMPKNELIFSKCFHFLFIVPKFT 

LLDHLAFLRLKLAFWRLGWHST 

1337 

6834 

A 

1424 

222 

350 

GAHTWTGISQAALQPTAPGRLSRTL 
LFLC*KCGEICNLLSGC 

1338 

6835 

A 

1425 

198 

385 


1339 

6836 

C 

1426 

207 

385 

MFFYKLAQMXQISVSTLKFKLVLFV 
PTGXQVNGGEPSTLYGRWGRXGSA 
PSWRYQPFCP* 

1340 

6837 

A 

1427 

327 

516 


1341 

6838 

A 

1428 

58 

255 

FSPDFYRGYIYFYHYFGFEKFFFTPS 
NFFISQSR*FFCVKMFSFFNLRFKIPL 
PNHVDFALCFFVV 

1342 

6839 

B 

1429 * 

242 

509 

MRPRKAFLLLLLLGLVQLLAVAGA 
EGPDEDSSNRENAIEDEEEEEEEDD 
DXGENDLEVKEENGVLVLNDANFD 
NFVADKDMDTARDLPPX* 

1343 

6840 

A 

1430 

338 

511 

NSFSKSKTPCVAHGRGVHAEAGNK 
RQYLGFLFFFF*FFILWCFTSCSEY 


/ 


WO 01/79449 


SEQ ID 
NO: of 
nucl co-tide 
sequence 


1344 


1345 


SEQ ID 
NO: of 
peptide 
sequence 


6841 


6842 


Me 
tho 


SETJID NO 
in USSN 
09/770,160 


1431 


1432 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


671 


Nucleotide 
location of last 
codon Tor last 
amino acid of 
peptide 
sequence 


454 


955 


PCTAJS01/08656 


Amino acid sequence ( X=Un known; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 


FFFF* IFTLGCFTSCSE Y*ITMNDVK* 
FSPEFLPEGYLLFLSLFGV*KIFFYTL 
LISLFLKAD/RFFCVKMFSFFNLRFKI 
PLPNHADFALCFFVV 


1346 


6843 


B 


1433 


46 


3152 


MRPRKAFLLLLLLGLVQLLAVAGA 

EGPDEDSSNRENAIEDEEEEEEEDD 

DEEEDDLEVKEENGVLVLNDANFD 

NFVADKDTVLLEFYAPWCGHCKQF 

APEYEKIANILKDKDPPIPVAKIDAT 

SASVLASRFDVSGYPTIKILKKGQA 

VDYEGSRTQEEIVAKVREVSQPDW 

TPPPEVTLVLTKENFDEVVNDADIIL 

VEFYAPWCGHCKKLAPEYEKAAKE 

LSKRSPPIPLAKVDATAETDLAKRF 

DVSGYPTLKIFRKGRPYDYNGPREK 

YGIVDYMIEQSGPPSKEILTLKQVQE 

FLKDGDDVIIIGVFKGESDPAYQQY 

QDAANNLREDYKFHHTFSTEIAKFL 

KVSQGQLVVMQPEKFQSKYEPRSH 

MMDVQGSTQDSAIKDFVLKYALPL 

VGHRKVSNDAKRYTRRPLVWYYS 

VDFSFDYRAATQFWRSKVLEVAKD 

FPEYTFAIADEEDYAGEVKDLGLSE 

SGEDVNAAILDESGKKFAMEPEEFD 

SDTLREFVTAFKKGKLKPVIKSQPV 

PKNNKGPVKVWGKTFDSIVMDPK 

KDVLI EFYAPWCGHCKQLEPVYNS 

LAKKYKGQKGLVIAKMDATANDV 

PSDRYKVEGFPTIYFAPSGDKKNPV 

KFEGGDRDLEHLSKFIEEHATKLSR 

TKEELMDVQGSTQDSAIKDFVLKY 

ALPLVGHRKVSNDAKRYTRRPLVV 

VYYSVDFSFDYRAATQFWRSKVLE 

VAKDFPEYTFAIADEEDYAGEVKD 

LG LSESGED VNA AILDESGKKFAME 

PEEFDSDTLREFVTAFKKGKLKPVI 

KSQPVPKNNKGPVKVVVGKTFDSI 

VMDPKKDVLIEFYAPWCGHCKQLE 

PVYNSLAKKYKGQKGLVIAKMDAT 

ANDVPSDRYKVEGFPTIYFAPSGDK 

KNPVKFEGGDRDLEHLSKF1EEHAT 

KLSRTKEEL* 


1347 


6844 


1434 


785 


1271 


LCTDQLHNFNNYFQDKDKCFYFPM 
FWSFLGLETEAACFKPDSKGKALQ 
NRKYFNWYLPSATSRDLWISPGWS 
QPFFFFFFFFFFFFF*RA 


1348 


6845 


1446 


549 


791 


GLLSN*NFFFSILIFFFQTESRSVA\RL 

ECNGAISAHCKLRLPGSRHSPASAS 

RVAGTTGAHHHAWLIFFVFLVETG 

FHHVSQDGLDLL/NLVIHLPRPPKVL 

G*QAGVQWCDLRSLQAPPPGFTPFS 

CLSLPSSWDYRCPPPCLANFFCIFSR 

DRVSPC 


1349 


6846 


1447 


59 


485 


NSPCSGSS1ATASPERRKGINPAPPST 
PAAPCRS*ACTAAAAAAVR\DDRLN 
VTEELTSNDKTRILNVQSRLTDAKR 
INWRTVLSGGSLYIEIPGGALPEGSK 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQID 

NO* nf 

peptide 
sequence 

Me 
tbo 
d 

SEQ ID NO: 

III ISOOli 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 







DSFAVLLEFAEEQLRADHVFICFHK 
NREDRAALLRTFSFL 

1350 

6847 

A 

1448 

125 

927 

NPPCSGSSIGPCSPKRRKG1KPGPPIH 

RKPAPSGS*\ACTAAAAA\AVRVPGS 

PSTAVWTRVRGVLGGAPDAPHPPLK 

DPRVGEGNSQRNPNL\SANLFYS\DD 

RLNEQKEUTSHDKDEHSRRPSSRLT 

DGKTPLNWRNNC*SGGSLFNIKPAG 

GRRWPEGSKGQLLQFLLEFA*GATC 

GADPKVFICFQTRTREKAEPPLLRVT 

FSFFGLCRLVRPGGIPLVPKETPDAC 

FMALQRSRESLPGEEEEVGARLRG 

WAIPPPWGHPLVCTGG 

1351 

6848 

A 

1449 

1 

866 

ESVDLAAEAVRIRRSPLIFSKAVHIM 

AAAFRKAAKSRQREHRERSQPGFR 

KHLGLLEKKKDYKLRADDYRKKT 

RITSKALRKXKALEKVNPDEFYYKMT 

RVKLQGWSYILLRETKGRK*PQEQL 

KADEELQDVKYIEMKRVAEAKKIE 

RLKSELHL/VWDFQGKQQNKHVFF 

FDT\KKEVEQFDV\ATHLQTAPELV 

DRVFNRPRIETLQKEKVKGVTANQT 

GLKRIAKERQKQYNCPAHQRIENRE 

KKLIPLLPQKIQTRKDLMDKTQKVK 

\VKKETVNSP\AIYKFQKSVENR 

1352 

6849 

A 

1450 

3 

896 

LRAVRVGLLLGGGGVYGS\RFR\FTF 

PGCRA\LSPWRVRVQR\RRCEMSTM 

FA\DTLLIVFISVCTALLAEG\mWVL 

\VYRTDK\YTRL*AEVEKQSKKLGK 

GRKETI\TEFSWFGQQKKKIERQEEE 

T*RNNNRDLS\MVR\MKSMFA\IGFC 

FTAL\MGMFNSIFDGRVVAKASF*P 

LFSYIQ\GLSH\RNLLG\DDTHRLVPF 

IF\LYILCTMSIR\QNIQKI\LGLAP\SR 

A\ATKQGRVDFLGPPPPSGEVLLEL 

KELFIFYSFFLGHTHIRMGQLFCSQE 

PIGSLYYLGLFLVLNYFLSLLGYD 

1353 

6850 

A 

1451 

2 

125 


1354 

6851 

A 

1452 

18 

1374 

LAEQI VPRG VG IRPPDKADQAPCRS 

PIRTPAPESWHCDSRQRFRQDSSRM 

KMRVLGLVVCLVLWTLHSEGSGG 

KLTAVDPETNMNVSEIISYWGFPSE 

EYLVETEDGY1LCLNRIPHGRKNHS\ 

DKGPKPVVF\LQHGLLADSSNWVT\ 

NLGNSRLGFILADAAIDVWMGNTR 

GNTWSPKHKTLSVSQDEFWAFSYD 

EMAKYDLPASINFL\LNKTG\QEQV 

YYVGHSQGTTIGFIA\FSQMLELAK 

GLKMFFAWGPVASVAFCTSPMAKL 

GRLPDHLIKDLRGDEEFLPQSAFWK 

VAGVPHLATHVIL\KELCGNLCFLL 

CGFNERNLNMSRVDVYTTHSPAGT 

FVQNM*HWSQAVKFQKFQAFDWG 

SSAKNYFHYNQSYPPTYNVKDMLV 

pta v+vmnunuin Trrv\/v/^ , \/xTT*T Tr\ 
r 1 A V \1 vjvJJrlU WL»JbL/V I Vj VINl^JL* 1 Vt? 

mNLVFHESIPEWEHLDFIWGLDAP 
WRLYNK1INLMRKYQ 

1355 

6852 

A 

1453 

165 

1353 

LPKPRLGPGQPEKDRTESSVRMAIT 


WO 01/79449 


PCT/US01/08656 


SEQ ID 
NO: of 
nucleoside 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

SEQ ID NO: 
in USSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence < X=Unknovvn; *=Stop 
codon: /==DOssible nucleotide deletion: V=nossihle 
nucleotide insertion) 







LEEAPWLGWLLVKALMRFAFMVV 

NNLVAIPSYICYVIILQPLRVLDSKR 

FWYIEGIMYKWLLGMVASWGWYA 

GYTEM\EWGEDIRAV\SKDEAVTLV 

NHQATGDVCTLMMCLQDKGLWA 

HMMWLMDHIFKYTNFGIVSLVHGD 

FFIRQGRSYRDQQLLLLKKHLENNY 

RSRDRKWIVLFPEGGFLRKRRE\TSQ 

AFAKKNKLPFNLTNVTLPRSGPTKIIL 

NALV\AQQ\KNGSPAGEDAKELASK 

SKGLQWIIDTR*PYPKAEPI/DNIQT 

W\VFGYRKPTVTHV\HYGIFP\IKDV 

P\I FTPni \TTWT 'MORFVFK'Pni T 

FYETGAFPPXSKGHKEAVSREMTLS 
NLWIFLIQSLAFLSGYMWYNIIQYF 
YHCLF 

1356 

6853 

A 

1454 

313 

650 

FVICV*TYTGMNTHSPTHTKTLSFLS 
n^r5*FPVPR^i /P'Min*OK'TVPnpwp 

LJ\j\j rr x v^ivoJU/v^iNivj v^Jtv i irA^riNis. 
LHTHTHTHTHTHRGYFSHTSSTVEK 
ALLTRIPEGSEEEDNGFYGWALRTI 
LVAFKSQCQMH 

1357 

6854 

A 

1455 

2 

401 

VSVGGLVGEVACACRDCIPETMAE 
GDNRSTNLLAAETASLEEQLQGWG 

C V iViJUlVi/\L/rv V Lt\. W CKA WfrrAl/ MO 

IIYYLDPSVLSGVSCFVMFLCLADY 

LVPILAPRIFGSNKWTTEQQQRFHEI 

CSNLVKTRRRA 

1358 

6855 

A 

1456 

18 

741 

AACGAFSRWVGVRVSVGGLVGEV 

ACACRDCIPETMAEGDNRSTNLLA 

AETASLEEQLQGWGEEMLMADKV 

LPWERAWVPPAIMGVVSLVFLIIYY 

LDPSVLSGVSCFVMFLCLADYLVPM 

LAPRIFG\SNKWTT\EQQQRFHEICS 

PKMYFMTMIVSLAA/VLLAWGQQV 
HNLLLTYLIVTSLLLLPWT*TQHGIH 
FERTLGMANMEINKLLKHK* *TN 

1359 

6856 

A 

1457 

2 

529 

GRVDPKAKKEAPAPP\KAEAKAKA 
LKA/RRKAVLKGVHSVHKKKKIPHV 
HPPF\RRPGRHLR\LRRQPK\YPPEEP 
RPRRNKLDHYA\IIKFP\LTTESAMK 

ICIFUTMNTT VRIVfiCI K'PTK'Hn\I*n 
rvlCM-'lNl^ 1 Li V r U V VJVsJurwi 1 rvxiv^u v< 

GC*RKLFWTLDVGAKVNTPGFGPD 
GRRRKAYVPTWLPDLPIAFGMFAN 
KIWGFI 

1360 

6RS7 

A 


J AD 


VARVFVOHROFI FRGT P1AATMHM 

KKSPEIISGRMTFWQPRNWLLFACH 

ATNEVAQLIQGGRLIKHEMTKTASA 

1361 

6858 

A 

1460 

438 

549 


1362 

6859 

A 

1461 

210 

556 

QFWGPVATWGLPIDAINDMKKSPEI 
inwr?n*T4PAT nrvoi tpmpii pw 

VXQPSGTWASCFACHGTNEVRPSFI 
QGEGRLIPNTRMDLKRASGINQWG 
KGKNKVFEGDSLCPGCC 

1363 

6860 

C 

1462 

110 

509 

MLLICSFAPATLXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxx 


WO 01/79449 PCTAJS01/08656 


SEQ ID 
NO: of 

(iuil€(r*lluc 

sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 

A 
CI 

Sl^pDNO: 

in USSN 

00/770 1AO 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 

COUun lui lual 

amino acid of 

peptide 

sequence 

Amino acid seqdVr( X=Unknoivn; *=«top 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







xxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxx 

XXXXXXXXXXXXXXXXXXVSA* 

1364 

6861 

A 

1463 

93 

180 


1365 

6862 

C 

1464 

128 

382 

MYLGISRRLSSMLTFLAYLHPRERP 
PHRAPXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXQQWAQG 
TGISIPRTCTSTGL* 

1366 

6863 

A 

1465 

3 

140 


1367 

6864 

A 

1466 

1 

609 


1368 

6865 

B 

1467 

1 

690 

MASWDEKDLTVPQPDTRKGSVLRC 

GLSSRALRWAGRGHVAAGWRPLA 

PESAGGWGMAAAMVPGRSESWER 

GEPGRPALYFCGSIRGGREDRTLYE 

RJVSRLRRFGTVLTEHVAAAELGAR 

GEEAAGGDRLIHEQDLEWLQQADV 

WAEVTQPSLGVGYELGRAVAFNK 

RILCLFRPQSGRVLSAMIRGAADGS 

RFQVWDYEEGEVEALLDRYFEADP 

PGQVAASPDPTT* 

1369 

6866 

B 

1468 

1 

975 

MSPPGREQGLLLNLLRPSGLDNAG 

KTTILKKFNGEDIDTISPTLGFNIKTL 

EHRGFKLNIWDVGGQKSLRSYWRN 

YFESTDGLIWVVDSADRQRMQDCQ 

RELQSLLVEEVGSSYPLCTWRFFSY 

LRIEQMYNLVLYRDIQFPDFCFNSN 

TDWSKGLKTHARFGNTSLHVAHTD 

STNTTNFVDVWRGRTKSLACLLQL 

SSLTC1YTAGKMRLQDRIATFFFPKG 

MMLTTAALMLFFLHLGIFIRDVHNF 

CITYHYDHMSFHYTVVLMFSQVISI 

CWAAMGSLYAEMTENKYVCFSAL 

TILMLNGAMFFNRLSLEFLA1EYREE 

HH* 

1370 

6867 

A 

1469 

25 

353 

EVCYYRSSEAFFSEL1KVILRHLCSV 

AGKGLCSIPQLNTREGSVLRRISKR 

GSPLAVEIEEGHCLNCLPLGTECLGI 

KPUVHLLNSEIGEKPPFSPLSPCSSA 

AFLLLR 

1371 

6868 

A 

1470 

79 

467 

RPESQRANGVDSGPNLKTVPQPDTR 
h KGSVLKWISKRGKPLAVEIEESHCLA 
CLPLRTECLGIKFJVHLFSCTRPVIV 
PSLELHYDIDS1AHMFVADLLLIITLL 
SYYIPFYLGFQNAGITGINHRAWFY 

1372 

6869 

A 

1471 

368 

611 

LCPSHFAPTTLTQSGSSLKTCWLNS 
RFKACRAVPGPCLVNQMFASSILGV 
KSHHHSLVPINQGHNALWKAAG\PL 
PLKAGY 

1373 

6870 

A 

1472 

441 

1178 

FVALPQPLCPSHFDPTTLIQSGAHKN 

MCCIKSRFKRDLGLCRTCLVNKMF 

TSSILGKSHCHSLVSINQGHNAPWK 

AAG\PLPFRAGYC\QGFSPCDSLKYG 

\SWDEKDLTVPQPDTRKGSVLRWIS 

QRGKPLAVEMEEGHCLACLPLGTEC 

LGIK/PPIVHLFNSEIGE\NRP\MVGG\ 

RHVLQ*CCLG*FL*LPLRCLG\GEKH 

KSGL/HVHIPVIVLS\LELNYDIDSFA 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

S1VDNO: 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 

Fnr npnHHc 

■ VI ututiuw 

sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqUflHr( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; ^possible 
nucleotide insertion) 







HMFF/SVDLLLIITLLSYYIPFC 

1374 

6871 

A 

1473 

1540 

1812 

GKFQLIKTLQNPVVCSLPVIPALWG 
GQRWVDHLRLGVRD*PGQHGETPS 
LLKNNNNNTKISWAWWHEPV\IPA\ 
MGEAEAGESLEP\GRRRLQ 

1375 

6872 

A 

1477 

1 

354 


1376 

6873 

A 

1478 

1 

411 


1377 

6874 

A 

1479 

2 

265 

RMYGKIIFVLLLSGIRNVHPK*IVSIS 
ASSTTGVAMHTSTSSSVTKSYISSQT 
NGITLINWWAMXARVIFEVMLVW 
GMIILISYCIR 

1378 

6875 

A 

1480 

111 

520 


1379 

6876 

A 

1481 

106 

395 

EEALPPPLHCTWVPFPSFECLQEMS 
KEIVSISALSTTEVAMHTSTSSSVTK 
SY1S\SQTNGRKRDNLSDRFHCTSSC 
SDNTSLFLCVMGWYYWERSS 

1380 

6877 

A 

1483 

3 

1078 

TRAAGLRAGVRVPRSPGPSRRMPA 

RSGAQFCRRMGQKKQRPARAGQP 

HSSSDAAQ\APAEQPHSSS\NPAQAP 

CPRERCL\GPPTTPGPYR\SIYFSSPK 

GHLTRLGV\EFFDQPAVPLARAFLG 

QVLVRRLPNNGTELRGPHRWETEA/ 

YTLGPEDXEAAPLQGGWPGKTPRVN 

RGMFH*KPGD/LWVVYIIYGMYFC 

MNISSQGDGA\CVFL\RALEAPGKS 

WRPMRQLRS\TLR\KGTRQARVLKG 

PPKLCSGP\SKL\CQA\LPINKSF*PEG 

TLAQDEAVWLERGPLEPSEPAWG 

S/APRVGVGHAGEWARK\PLRFYV\ 

RG SP WAS VVDRVAEQDTQACAKG 

LPRQDFLHCLKTRINVLFLEKKKKK 

1381 

6878 

A 

1484 

3 

452 


1382 

6879 

A 

1485 

26 

493 

NSTDSERTHPWLLSPARQRPTSRPA 

WGKVGAHAVRSMCAEALERMFLS 

FPT\TKTYFPHFDL\SHGFCPG*RATG 

KKVDDSADQTPWPTWDDMPKRRC 

PP* SDLHAHKLS/RLDPWNFKAPKA 

TCLAG*PLAAHLP\AEFQPLAVARLP 

WGQISWGFC 

1383 

6880 

C 

1486 

30 

200 

MC1SYTKGHFVVVWWVFPGFSK1LF 

RYISCPAPCRSSRMQSQCAHSSQSE 

VPGHRA* 

1384 

6881 

A 

1487 

31 

664 

APALPGCEHMMAIRELKVCLLGDT 

GVGKSSIVWRFVQDHFDHNISPTIG\ 

ASFMTKTVPCGNELHKFLIWDTAG 

QERFHSLAPMYYRGSAAAVIVYDF 

TEAGFHFHPLKKWV\KRLKELGPE\ 

NIVMAIAGNKCDLSDMEVPPEGML 

KEY\AESIGAIVVETSAKNAY*YRKS 

SFKEISRPDPHPWTPHENGNNGTIKS 

♦EANPCKPVRRCC 

1385 

6882 

A 

1488 

124 

1180 

DLGKPLFKVQEEGGRPPTLNQSEW 

ALCPQGPGANHDARSLIMDSPRAG 

THQGPLDAKTEVGADRCTSTAYQE 

QRPQVEQDGKQAPLSPGLPAMGGP 

GPGPCKDPAGCGGAGANGGSEPLVT 

VTVQCAFTVALRARRGADLSSLRA 


WO 01/79449 — PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
a 

^piDNO: 

inUSSN 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqfBwR ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







LLGQAFPHQ\AQLGQLSYLAPGEDG 

HWVPIPEEESLQRAWQDAAACPRG 

LQLQCRGAGGRPVLYQWAPPPPK 

RGTPKPKRALDPFPPPSPPPPPKGPE 

DLGFRQGDTVDVLCEWGWAWPFP 

GSTWPGVWGA*SSACAVSRRGTQ 

KATPG*RPAEPDVPLAVDQAWLEG 

HCDGRIGIFPKCFVVPAGPRMSGAP 

GRLPRSQQGDQP 

1386 

6883 

A 

1489 

1 

229 


1387 

6884 

A 

1490 

3 

461 

AASTRFRASGQFVMAGAGSAAVSG 

AGTPV\AGPTGRDLF\AEGLLEFLRP 

AVQQLDSHGHAVRESQVELRDQID 

NLATELCR1N*GQKVAL\DLDPYVK 

KA/ILNARRRV\VLVNNILQ\NAQE\R 

LRRVNHRWAREQPARRAMLDSGIY 

PPGSPGK 

1388 

6885 

A 

1491 

1274 

1416 

FGIFSQFSVLH*SGK*A*N\YYYYYY 
YYYCYFYKMEYGSFFNLQVTF 

1389 

6886 

A 

1492 

243 

1 125 

FQQRLYRAARRFTMVKIAFNTPTA 

VQKEEARQDV\EALLSRTVRTQILT 

G\KELRVCHPGKKEGSSGEMLWFTL 

FRAFQFILG\GLYLFGGACIYK/YTF 

MPKRHHFTVGEMCFFDSEDPANFPF : 

GGGEP*LSCLVT*/EEADIREDDNIAI 

IDVPVPSFS\DSDPAANYFMTFEKG 

MTA\YL\DLLLG\NC\YLMPLQYFYL 

LWPPKKIWVELFGQTGRVGRY\LP\ 

QTYVVR\EDLVAVSRKIRDVSNLGIF 

IYQLCNXNRKSFRLRRRDLLLGFNK 

RAIDKCWKIRHFPN\EF1VETKICQE 

1390 

6887 

B 

1493 

28 

282 

MYHDWRLVPKHEEEAFTAFTPAPE 
DSLASVPYPPLLRAMIIAERQKNGD 
TSTEEPMLNVQRIRMEPWDYPAKQ 
EDKGRAKGTPV* 

1391 

6888 

A 

1494 

2 

187 

QNDRKMETQA/PEEPMLNVQRIRM 
EPWDYPAKQEDKGRAKGTPHPTPR 
AHAGESGKRSLPFPH 

1392 

6889 

A 

1495 

302 

771 

RKRGVCTHLLCRRRSASNCRAPALP 

SLTFEGQDAPGLPVVQVLRRWGP 

HPREAPVARLWLPQPGLARDQAA 

QAEEREAAEQLQ/AGGSRGARRPRP 

GPAAGVQRTAASRSRSPPRRAGSRA 

ACSVAPAGRARGGPAPRSAADAPS 

SAPWRVRVLSG 

1393 

6890 

A 

1496 

497 

1212 

SWPGGEAGTARRPGCLPAPA*TR*R 

PPPRAWAPAASHGT*RVRSPAPRSQ 

SSLMKKKRR\FEGQDAPGLPWQV 

VQSHKQAGSARKALGPRGQEVGAS 

/DHANLQGGGEAGRPAGCRVGVRT 

GCTHLLRVVGPHPREAPVARLVVL 

PQPGLARDQAAQAEEREASEQLQQ 

SGSRGARRPRPGPAAGVQRTAASRS 

RSPPRRSGSRAACSVAPAGRARGGP 

APRSAADAPSSAPWRVRVLS 

1394 

6891 

A 

1497 

852 

1562 

FGKAGWELNRRERGSWRVEEKDL 
QRWGVCGGG LCLPKPVRRSGVCLS 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleoside 
sequence 

SEQ ID 
NO: of 

peptide 
sequence 

Me 
tho 
d 

^^DNO: 

inUSSN 
09/770 IfiO 

Nucleotide 
location of 

first on<1nn 

for peptide 
sequence 

Nucleotide 
location of last 

amino ncid of 

peptide 

sequence 

Amino acici sequvR ( A=Un known; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







LKHISGGLRTLSQAPNWRNNSNGR 

VRGKHSLNLAQPFHPSLSYELKPCC 

VSQGLQRGILPPPQVSGPTFCSPKAL 

TPPSVRVLPPPPPP*CASVCMSVNSP 

PALPTPLAKVSPADLAPRD*AHISG* 

RG*PLGHPHLSPLMLFTSPEPSE/PPP 

YPPSAPVPSSFSPARPAPPTVWPPPM 

QHRLWLPFPSS 

1395 

6892 

C 

1498 

127 

355 

MKNRILQRNGXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxx* 

1396 

6893 

A 

1499 

529 

2082 

FLDLLHFTTSLIIPHYKRIRDLTYTIR 

HFPFVHLSPKCLDRLPSLFLIHLLGV/ 

VIL*ALEPIH*SSSFLFH*LKIFSSPLW 

NI*VSPTFDSHF/CFLLILNLFLLSFPL 

SALI/NFQLQF*IKNETFHTLLIG*SEH 

*I/HT*MISTLFWLFK/LLLTGSLV*A 

RIFSCVHTQYHHACAIEKEMYLSSL 

FLFFMNNSKSCLI LFCSIIGPEFFDILL 

IC/LFFFLD*IIFSSIPALQ*YYLWALN 

AHSSQKARDSMLS/F*/ILVCWIAVT 

LG*DTLLAVTLE*DILSKVHCFMDPI 

SSSFLVFNPHFGWENFPQRYFPQVR 

VQRRLS*KFA*QEKCPPLWMFLPEQ 

SALTMKFLGYQKISLKALKDVDPID 

FWAFSVG**QIC*QSISHPFLC*ANR 

DDLICFLPGHFNIFLLSLIFQNFTILC 

LDVRLYWALSIWRLSL/RLFFN/IFFF 

LFLHLTLSLASLPGTPNITY*MTMSL 

FFLYNFYYLSLFALYLGEIPSTLFFQT 

VYQ1LLAVLFYFQRDLPYSQSSLSFP 

1397 

6894 

A 

1500 

3 

930 

SSRGRAGGVWRFERDEDGTGAGCG 

QWTRFCREPKMAVNVYSTSVTSDN 

LSRHDMLA\WINESLQLN\LTKIEQL 

CSGAAYCQFMDMVLFPGSIALKKVK 

FQAKLEH\EYIQNFKILQAGFK\RMG 

VDKIIPVDKLLKGK\FQDNFEF\VQW 

VKKFFDANYDGKDYDPVA\AR\QG 

QETAVAPSLVAPALNKPKVKPLTSSS 

AAPQR\PI\STQRTA\AAPKAG/PLGV 

VRKNPGVGVNGDDEAAELMQQGQR 

I*NLLFEDLGGKERDFYFGKLRNIEL 

ICQENEGENDPVLQRIVDILYATDE 

GFVILDEGGPQEEQEEY 

1398 

6895 

A 

1501 

3 

87 


1399 

6896 

A 

1502 

1 

667 

RRSSARRGGRSEPGRAAGGGAAED 

TRRRAGDMDRGEQGLLRTDPVPEE 

GEDVAATISATETLSEEEQEELRREL 

\AKVE\EEIQTLSASVKQQKRKHL\A 

EIQAGNLGINSLQELKQNIAQGW*D 

VTVT\SAYKKTVSETLS\QAGQKA\SA 

AFSSVGSVITKKLEDVKNFPNFLNH 

FEEKVENLKSKVRGH/TKPAGGDFV 

GEV\LNSAANASATTTEPLPEKTQES 

L 

1400 

6897 

A 

1503 

1 

395 

AKAKMADVLDLHEAGGEDFAMDE 
DGDESIHKLKEKAKKRKGRGFG/SR 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 

(1 

in USSN 
Uy/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqinHF ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







*VWGKWGRVA**GRSYGFWSPHPP 
HRNNGRKEEFSVISCMSLIEEGSRA 
RMREDYDSVEHDGDEPGPQRSVEG 
WILFVTGVHEEA 

1401 

689S 

A 

1504 

146 

833 

CLSGTDEISIEGEMADVLDLHEAGG 

EDFAMDEDGDESIHKLKEKAKKRK 

GRGFGSEEGSRARMREDYDSVEQD 

GDEPGPQRSVEGWILFVTGSP*RKP 

PEEDIHDKFARIMGEIKNISSSTFDR 

RTGMLKGYTLSLNIETYKEUQAA 

MEGLNG\QDLMGQLGGV\DWCFVR 

GPPK\GKRRGGRRRSR\SPDRRPSLT 

GPLLSRCSLQDSIWTMAALGQIGLG 

WELCCVYI 

1402 

6899 

A 

1505 

717 

1563 

APLPAVLTQTIHLVTGTAFHSGKVD 

IVTIGYPFIDLNDMVCMSQYDSTHG 

*FHSTVKTENGKLVINGNCITHHPR 

RDPTKIK*DDAGTEYVVESTGVFMT 

MEKAEAHSAPSADG\LNDEKYENSL 

KIIGNASCATKGFAP/LPAKVIHDNF 

GIVEGLMAMVHAITATQKTVDGPS 

KK\WHDSHGALQNIIPASTGATK/A 

GMAFLVSTTNVLVMDLTLEGILGY 

TEHQVVASDFNSITHSSTFKAGVGI 

ALNNHFVKLISWYENEFGYSNRVV 

DLMVHMASKGSS 

1403 

6900 

A 

1506 

625 

2919 


1404 

6901 

A 

1507 

2 

76 

HHYAKLGTRAVRRARRCAGWQSY 

VDNLMCDGCCQEAAIAGYCDAKY 

VWAATAGGV\FQSITPIE1DMIVGKD 

\RKGFF\TSGLTLGA\KKCSVIRDSLY 

V\DGDCTMDIRDKQS/QGGEPT\YNV 

A\VGRSG\RALVI\VMGKGKVFHRR 

HTLTRKAYETPLYT*RQAWHEGSA 

KGSKMCRLAELRG 

1405 

6902 

A 

1509 

63 

290 

GGILLSISRPYKTKPTHGIGKYKHLI 
KAEEPXKKKKGKVAVRAINLGTDY 
NYGVLNIH\LT\AYDMTLAESYAQY 
VHN 

1406 

6903 

A 

1510 

315 

1092 

RPRSSKRMSGTSEKVLCLRNNTIFK 
QAFSLLRFRTSGEKPIYSVGGILLSIS 
RPYKSK\PTHGIGKYKHLIKAEEPKK 
. KKGKVE\VRAINLGTDYEYGVLNI\ 
HLTAYDMTLAESYAPLFSTTFCNSL 
SH*KSEESYAMPTQNHKKWLPVCR 
DQGQPKCLLGLKCLPTHE\RV\VQIS 
GLSATFARKFSWKIIPKPVLP*RESG 
LFS*REHTERKTSRGRFQRFDPELGR 
TFWAKFEVATVDPFHCQQWSYLSA 
KEKSLLGS 

1407 

6904 

A 

1511 

284 

758 

KQNPSSPLQRLIAGSNLDSEPRIQTD 

ILKQATKDRVSDFHKLKQSRFFDEN 

ESPVDPQHGSKLADYNGDDGNVGE 

YEADKQAELAYNEEEDGDGGEEDV 

PPDEEREL/PNGKKQAMESNISMMS 

FKS*RNASENLKCCKMKSFYFVLSD 

FCCKDELYQL 

1408 

6905 

A 

1512 

148 

476 



WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 

mirlpfk-Hrt** 

IllltlVv^llUC 

sequence 

SEQID 
NO: of 

npntirfp 

UCUllUv 

sequence 

Me 

tho 
(j 

in USSN 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino add seqlNHe ( X=Unknown; *=Stop 
codon; A=possible nucleotide deletion; ^possible 
nucleotide insertion) 

1409 

6906 

A 

1513 

204 

620 

GDAMAAVTPRPPLPEGCRAPSSAPT 

VSLPELRSLLASGRARLFDVRSREE 

AAAGTIPGALNIPVSELESALQMNDQ 

LAFQGFIFLLEKPKLGR*ASSFSFLFR 

WGKRGLQATQLARSLGYTGGFATY 

AGAY*\EWLEKES 

1410 

6907 

A 

1514 

1386 

1711 

FPKSIMGLVTIQDMNLCIKFIGLSHL" 

PALVLLYC/DVRACVMSGC/LRKTS 

EKNLKSRPSFFACSILSRNVQCNWT 

KIGMFKDI*GFFCFPYFYLFSVSCIFA 

YLCRFSGF 

1411 

6908 

A 

1515 

481 

1380 

TSKQNAAPLVKYFQEKGLIMTFDA 
DRDEDEVFYDISMAVDNKLFPNKE 
AAAGSSDLDPSMILDTGE1IDTGSDY 
EDQG DDQLN VFGEDTMGGFMEDL 
RKCKIIFIIGGPGSGKGTQCEKLVEK 
• YGFTHLSTGELLREELAS*SERSKLI 
. KDIMERGDLVPSGIVLELLKEAMVG 
\SLGDTRGFLID\GYPRE\VKQGEEF\ 
GRRIWRPHSWVICME\CSADT\MTN 
RL\LQRSRSSLPVDDTTK\TMAKRLE 
AYYRNASIPVIAYYETKTQLHKINAE 
GTPEDVFLQLCTS*LTLLFSEGKNA 
CLG 

1412 

6909 

A 

1516 

47 

416 

NSYIYMCIYSYINTIYIHIYLESNISLP 
LNIY1STPT/HIY*RHTV* VHTKAYV 
HML*HVYIHFCLCVHKSFKGTIYRD 
ASFLESCSKVNTECHKLRKVKRKYS 
RIHHTGIHQSSLIITSPFTF 

1413 

6910 

A 

1520 

1386 

1666 

SLMAPQKMGRTSCSPSERLGN*GPE 
TGSDSHKTPQQGCKGGRTRGSIVSL 
GDRRPLAP/GACFAGDKDFLGLRSP 
GVGTALLGCTSINQRLLWA 

1414 

6911 

A 

1521 

304 

1253 

VTNEMSQGVGKYDFYIGLGLAMSS 
SIFIGGSFILKKKGLLRLARKGSMRA 
L GQGGHAYLKEWLWWAGLLSMGA 
GEVANFAXAYAFAPANSSGLHLGAL 
\SVLVKCPFFLSYFLNERLNLHGKIG 
CLLSILGSMVMVIHAPKEEEIETLNE 
MSHKLGDPGFWFATLVVIVALILIF 
VVGPRHGQTNILVYITICSVIGAFSV 
SCVKGLGIAIKELFVEGSLCCGIPWA 
WILLLSLIVCVSTQINYLNRAIADIFN 
TSIVTPIYYVFFTTSVLTCSAILFKGV 
GKDMPVGRCPLVL* SGFFTIIVGGY 
SCCMPLKTSALA 

1415 

6912 

A 

1522 

20 

131 

KEILPKQAFAVAPTYITEPVEIKFFFF 
SRIRL*VPPG 

1416 

6913 

C 

1523 

7 

348 

MSKLYIMQFXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxx 

XXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXHY 
TV* 

1417 

6914 

A 

1524 

1103 

1388 

VLLIFNLFPMALYFVCFSF/CFFETEA 
HSVT*ARVQWHDLGSLQPLPP*FK* 
FSCFSLPCS*YYRHLPPYPANFCIFSR 
DGVSPCCPGWS*TPNLR 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleotide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

inUSSN 

£\ fV MM/I 4 /A 

09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqUWKT ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 

1418 

6915 

A 

1525 

149 

421 


1419 

6916 

A 

1526 

1 

1107 


1420 

6917 

A 

1527 

25 

1486 

GPQQPHSRSTHASGRPQSLSPVLSLS 

PDSMSFTTRSTFSTNYRSLGSVQAPS 

YGARPVSSAASVYAGAGGSGSRISA 

SRSTSFRGGMGSGGLATGV\AGG\L 

AGMGGIQNEKETMQSLNDRLASYL 

DRVRSLETENRRL\ESKIRE\HLEKK 

GPQ\VRDWSHYFKI\IEDL\RAQIF\A 

NTCGTMPRIRSARIDNA\RLA\ADDF\ 

RVKYEDRSWPMCPVLWRTDIHGLP 

KVH/IDDTNYHTDLQLETEAEALK\E 

ELLFQ*RRNHEEGS*KALRRQASSS 

GMNAWRLD\APKSQDLAK\IMADIR 

\AQ\YDELGSKKNPRGSLDKY\WSQ 

QI\EESTTGGSPQKSAEVG\AVETHA 

HRSLKRTVPVLGRSTLDSMRNLKG ' 

QLWRTSLREG*RPAYALTRLEPAPT 

GSLL\HLESEL\AQTRARGTAARPRE 

YEALLN\IK\VKLEAEIA'RYRRL\LED 

GEDFNLGDALDSSNSMQTIQKTTTR 

RIVDGKVVSETNDTKVLRH 

1421 

6918 

A 

1530 

277 

693 

PWHCPDSHYSQQPGSTASSSVPART 

GGPCWSSSCSPN*CCTSCCSTTPTPT 

LDPR/GSRHCQLPWLKQLPKGM/CT 

STCLHGSCGCICGSCGPCGCIDGPTL 

GRNHNEPHHHSHGDLPYRFPEHAH 

HGHATLMGLMDTPLV 

1422 

6919 

A 

1531 

1 

364 

PFVSLGLMCFGALIGLCACICRSLYP 

TIATGILHLLADTML*SPGHMEVSC 

VDAPAEIPS*APN*QPTSFPAMCGSH 

PGCPALTSLQRTSATAIILLHPCETLI 

KNQLAEPNQPMELIEIK 

1423 

6920 

A 

1532 

1 

898 

RGESRVLWSELEGEAGGAGGWASS 

LNARMDNRFATAFVIACVLSLISTIY 

MEGNSIGTDFWYEYRSPGQENSSDL 

NKSIWDEFISDEADDATYN\DALF\R 

YNGTVGFLRRVYSPYPKTLHWVLA 

HHERTESFDVVTK\CVSFTL\TEQFM 

EKF\VDPGK/HTNSRGLDLLRTYL\W 

RCQFLLPFWSLGFDVLWGALIGLC 

A\CICRSLYPTIATGILHLLAGLCTLG 

SVSCYVAGIELLHQKLELPDNVSGE 

FG\WSFCLACVSAP*QF/LWASALFI 

WAAHTNRKE\YTLMK\AYRVGMSK 

KPACF 

1424 

6921 

A 

1533 

939 

1591 

LQSLCRVLLQMESGSRD71PGVCKR 

ERENREDTEVLQPRFPYQGGQLVG 

KAATPQPFG*STVGWTLQPQ*P*YP 

AGQGCPP\RCPPAPSS*GSKRPPEPQ 

QGRLGPKPGSPQA*GNASPPKCPPA 

PVPTPC\PPTLCTGEKTGGARAGPW 

VGAGSPW*DPR*DGPIPCPVGDPLSP 

HPCCV1VALLPFNVSVPGGRGGAPP 

QPP*T*PKAMAVAPPFV 

1425 

6922 

A 

1534 

34 

912 

GRIRMQRQSTTGGRGIMEGPRGWL 

VLSVLAISLASMVTEDLCRAPDGKK 

GEAGRPGRRGRPGLKGEQGEPGAP 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqflOTtF ( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







GIRTGIQGLAKGDQGEPGPSGNPGK 

VGYPGPSGPLGARGIPGIKGTKGSP 

GNIK\DQPRPAFSAIR\RNPPMGG\NV 

VI\FDTVI\TNQEEPVQDPSG\RLVCP 

EPGYYYFT\FQGAGPQW\EICLSIVSL 

LQGARVRRSPGAFC*HPPTRGLFQV 

VSGGMGL\QLQQGDQ\VWVEKKPP 

QKGSHFYQGSE\ADS\VFTGFL\IFPIC 

LSQGRTPSPPPTSLASMLRL 

1426 

6923 

A 

1535 

919 

1260 

YSVSEFRGQTLTAKFCFFERESHSW 

PRLECRGTILAHCNLCLPGSSDSPAS 

ASRVAG\TTGACHNARL1FVFLVET 

GFHHVVQAGLNS*PQVIHPPCPPKV 

LGLQACTWHLAH 

1427 

6924 

C 

1536 

130 

441 

MVQNKGQLPANSQRGTEAXXXXX 
XXXXXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxx 

XXXXXXXXXXXXXD* 

1428 

6925 

C 

1537 

5 

316 

MVQNKGQLPANSQRGTEAXXXXX 
XXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXD* 

1429 

6926 

C 

1538 

74 

91 

MTLAM* 

1430 

6927 

A 

1539 

368 

430 


1431 

6928 

C 

1540 

191 • 

592 

MGQQPGQARAPSYCRCPLSPGSGR 

ALRWERPGGGQGPKEIVLSGCVPE 

KGPQTPAQPHSLRHLQNPEATARTG 

EEATSAAGGPWASPSFGGTQLCSDT 

MPALLGARSTCWIATHTHVCTLPLS 

ECGPINILLE* 

1432 

6929 

A 

1541 

58 

1531 

VIAVTSALPGRTQAAWTRVVKMDL 

LAAKMAVGGGSLMTDLTSSISKPL 

VPVGNKPLIWYPLNLLERVGFEEVI 

VVTTRDVQKALCAEFKMKMKPDIV 

CIS\DDADMGTADSLRYIYPKLKTD 

VLVLSCDLITDVALHEWDLFRAYD 

ASLAMLMRKGQDSIEPVPGQKGKK 

KAVEQRDFIGVDSTGKRLLFMANE 

ADLDEELVIKGSILQKHPRIRFHTGL 

VDAHLYCLKKYIVDFLMENGVSITSI 

RSELMPYLV/RGKQFSSASSQQGTRK 

EKEGGSKGKRGLKSFRISYSFY*KE 

ANYTGTGAPY\D\AC*NACRGDRWE 

DLSRSQVRCYVHIMKEGLCSRVSTL 

GLYMEANRQWPKFLSALCPGRTHQ 

SISSSPDCQHYTWVGVD\SLIGPRDH 

RLGEKVIPLSASVIGSSL/CLIKDRVT 

IT\NCL\LMNSVTFWRKEANIQGSVI 

L\NNAVI\EKGADIK\DCLIGKWARRI 

EA\KAKRSVCR*S*GNDQLMEI 

1433 

6930 

A 

1542 

315 

644 

EEETPKDQCLGAHFSSDTFPRQSRL 
LGVHGRGAGSLCTGLLSEHCPSTPP 
SGSPRVTYYLPH\PPLPPSP/SWAPTP 
PAHR* KPSTEPFPGPSTQV V*PLPMF 
TTKGPAPP 


WO 01/79449 


SEQID 
NO: of 
nucleo-tide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


Me 
tho 
d 


S10BNO: 

in USSN 
09/770,160 


Nucleotide 
location of 
first codon 
for peptide 
sequence 


Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 


iu^l^> 


PCT/US01/08656 


Amino acid scquHB^ X<=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possibte 
nucleotide insertion) 


1434 


6931 


1545 


1 


364 


FFFFF*DGVSLLWPRLERNGVISAHC 
NLCLPASNDSPASASLVAGITSAC/R 
PLPPKQK*LFFFLRWSLALSPGWSA 
VARS WLNATSPGYF* *KQGFHHAG 
QDGLGLLIHPPWPPEVLGLQA 


1435 


6932 


1546 


391 


1436 


6933 


1547 


2371 


GPPGRARARGLRRAPAAFLRRSLSL 

PAAFSSAAGPSSPQRSQEGRREPRTP 

RSSRGSRGSRPRRRLRLRRGRSAIH 

GEGRTAKGSAAWSARTFRSPPGVG 

RDPMRRAHEGREIPSLGGARRREVL 

QAGRSQRAAGRRRRRQELELGVGS 

GRPGGPPPGPGRRGTCAAALPPEWP 

RRRTGLPRRGPRPPLAMAKWLNKY 

FSLGNSKTKSPPQP\PRPDYREQRRR 

GERPSQPPQAVP\QASSA\ASASCGP 

ATASCFSASSGSLPDDSGSTSDLIRA 

YRAQKERHFQDPYNGPGSSLRKLR 

AMCRLDYCGGSGEPGGVQRAFSAS 

SASGAAGGCCASSGAGAAASSSSSS 

GSPHLYRSSSERRPATPAEVRYISPK 

HRLIKVESAAGGGAGDPLGGACAG 

GRTWSPTACGG*KLLNKCSSSSAEE 

SGAGMKDKVTIADDYLDPFDAKND 

LKSISGKGESAGYMEPYEEQRIMTE 

Y*RQECVRSQH*GIQLYETP*EPEGQ 

NFESESESTVNPRMRENKLPQDYEQ 

/RPAD*Y\DQPLELNPV/TQFPALAA 

QFNGNEKPQSSPSPSR\DRRRQL\RA 

PGGGF\KPIKHGSPEFCGILGERVDP 

AVPL\EKQIWYHGAISRGDAEN/LCL 

RLCKECSYLVRNSQTSKHDYPLSLR 

SNQGFMHMKLAKTKEKYVLGQKS 

PP\FDSVPEVIHYYTT\RKLP\IKGAE 

HLSLLYPVAVRTLLSGPDSALLCDR 

AWRLARCQRPPTNQPPATVAGCW 

CVVCMVLAHHCMSLECCCHLRGLE 

KAW1KTEGRQHTTPSPNPNEALEFL 


1437 


6934 


1548 


304 


678 


PQV1LPPLVSQGCWELPDVSPLRPSL 

VW*FL/RK*KLDLPCDPAIPLLGVYP 

RKIKACFHTKTCIQIFIATLFGIAKKK 

GKQPKQPSAGEWINTWW/HIHTMK 

HCSAVKKEQT1ET1SLFRSRIWRI 


1438 


6935 


1549 


80 


623 


LGGVTRGFNMRIEKCYFCSGP1YPG 

HGMMFVRNDCKVFRFCKSKCHKN 

FKKKRNPNRKVRWTKAFRKAAGKE 

LTVDNSFEFEKRRK*TYSNYQRDLL 

G1KT1\DAMKRVEEIKQKPPS*IYNEQ 

IE/GKIKSYRKFQDIKRSPSQNIPSLS 

RAPPCQAKGKQLGEEMGTARLQGG 

CGHGKMPP 


1439 


6936 


1550 


1021 


1544 


EPTKKCCVYYAQFLSLPSLFFPTGSE 

EQDS1YFILFFEMEFRSWTQAGVQW 

CDLGSLQPPPPGFKRFSCLSLPSSWD 

NRLLPQGPANFCIFSR/GWGFHRVG 

QAGLFSRDGVSPS*SGWSRTPDLV1 

RPPWPPRVLECSGQYIIFPWLFSRFS 

LS1FSKFDCNLSQFGFIWICLFYSS 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
a 

in USSN 

Uy/77U,I60 

Nucleotide 
location or 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acia seqiraNte ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 

1440 

6937 

A 

1551 

2 

210 


1441 

6938 

A 

1552 

2 

310 

IGVKMEEYAREPCPWRIVDDCGGA 

FTMGTI\GGGIFQAIKGFRNSPVGVN 

HRLRGSLTAIKTRAPQLGGSFAVW 

GGLFSMIDCSMVQVRGKEDPVANSI 

TSGCLNGEPYWQARNGTQWPMVG 

VSPQWVGIFPSF*FEGSWYLCLTRF 

ASGTVFPMGP\QFA\EDPSPVCLSTQ 

LPSLVPFGGGIFQAIKGFRNSPVGVN 

HRLRGSLTAIKTRAPQLGGSFAVW 

GGLFSM1DCSMVQVRGKEDPWELH 

HKWVP 

1442 

6939 

A 

1553 

1 

4629 


1443 

6940 

A 

1554 

96 

721 

PGQLSSLTPPRPASLLPWRAAYLFLN 

LFLPAGLLAQGQYDLDPLPPFPDHG 

QYTHYMDQIDNPDYYDYQEGTPR\ 

PSEGQFQ\F\QSQQEVQQGVIPSPNPR 

AQGNAEL\EPTEPGPLDCREEQYPC\ 

TRL\YSIHRPCK\QCLNEVCFYSLRR 

VYVINKEICV\RTVC\AHE\ELLRA\D 

LCSGTSFSKCGR*WASSGL\CQSV\A 

ASCAXRSCGSF 

1444 

6941 

A 

1555 

262 

732 

FQNKGNFFSTKRTEVSPSTQFNYFA 
RKNTTLIIRISHSSLGQVRIRLVWFG 
LVWFWFLETGV/CTLVIRGWEFQW\ 
CDQNSLQP*TPGLQRIFPTSASQSTG 
ITGVSHHSLVRYVFVTEIQLKFWILI 
TKITVLLVYN*L*NKGY*YIFIFTFFL 
NLQN 

1445 

6942 

A 

1556 

162 

496 

HSYIHIVHVCNFFMYSFAVFVFKKH 

LLLCLYNRTVI1YYNLGKL*INS*FK 

QPVYMHI*VLYC/INLCFTYMVKAA 

RILLLICNYTHKIYICMIHEIYLEMFII 

LMDILWCE 

1446 

6943 

A 

1557 

2 

247 

GEIVVFKVEGRDIPIVHRVIKVHEKD 
NGDIKFLTKGDNNEVDDRGLYKEG 
QNWLEKKDVVGRAR/GYALLAVM 
GAYVLLKRES 

1447 

6944 

A 

1558 

1 

503 

VRAGAVGAHLPASGLDIFGDLKKM 

NKRQLYYQVLTLAMIVSSALMIWK 

GLIVLTGSESPIVVVLSGSMEPAFHR ' 

GDLLFLTNFREDPIRA/GDNGDIKFL 

TKGDNNEVDDRGLSKEGQNWLEK 

KDWVGRARGFLPYVGMVTIIMND 

YPKFKYALLAVMGAYVLLKRES 

1448 

6945 

A 

1559 

180 

257 


1449 

6946 

A 

1560 

2 

676 

FVRCSAAVCATQSRRAARSPENPA 

MVRAGPPWGLNLPASRLGISSAD\L 

KKMNKRQLYYQSFKPSPWIVSSAL 

M1WKGLIVLTGSESPIWVLSGSME 

PAFHRGDL\LFLTNF\REDP\IRAGEIV 

VFKV* RPRTFPISSQR* SKVHEKD\N 

GDIKFLDLKGDNN*SLMD*EALYK 

KARNWLEKKDVVGRARGFLPYVG 

MVT\IKMNDYPKFKYALLAVMGA\ 

YVFLKR 

1450 

6947 

C 

1561 

449 

820 

MVIXGQISPMTATSGQKAFLAGPLG 


WO 01/79449 PCT/US01/08656 


SEQ1D 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Mt 
the 
d 

■ ^■bNO: 

» inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codoD for last 
amino acid of 
peptide 
sequence 

Amino acid seqwe ( X=Unknown; *=Stop 1 
codon; /=possibte nucleotide deletion; V=possible 
nuclpotirlp inc^rtinn\ 1 







SLGTLPRSQAVKRRLLLKLTKDRIF 
RDVLLLMFLKGHRETHGQCLQRQQ 
TKESTSIVVLALCGLYGKQVVTLLN 
VGIPFSINVYFTTPKSPLIMKLII* | 

1451 

6948 

A 
n 



All 


1452 

6949 

A 

1563 

22 

212 


1453 

6950 

A 

1564 

160 

397 


1454 

6951 

A 

1565 

17 

262 

LFKWAKALNRHFSKKHIQMANQH 
MKRCSTSSAIRE1QVKIAMVYN*YTI 
*HSQ\YSLQLPFPLWKTVWKFLTW 
KLLNCSVK 1 

1455 

6952 

A 

1566 

2 

294 

GNKMAAPKGSLWVRTQLGLPPLLL| 
LTMALAGGSGYTASAEAFDSVLGDT 
ASCHRACQLTYPLHTYPKVGPVRS 
GLRPFPCSPFLGSPHVCRLWQPGC [ 

1456 

6953 

A 

1567 

366 

1412 

QRGTRWRRERGSLWVRTQLGLPPL 

LLLTMALAGGSGTASAEAFDSVLG 

DTASCHRACQLTYP\LHTYPKEEEL 

YACQRGCRLFSICQFVDD\GIDLTRT 

KLECESACTE\AYSQSDEQYALPFL 

GCQNSACHFAENLRQEQLYVPRWP 

KMAPTFFL*LLGEGSFWELT*WDSA 

Q\SFITSSWTFYLQA\DDGKIVIFPV* 

SQKSQYAPHFGAREPTNFEENHLLS [ 

KMSSDLQMGKFHQAHQGIFLKNEE 

RDGLFKKPSILNSGWILTTVTLVLSYV 

MVLLWUCCATVATAVEQYVPSGE 

AGVTMGDLEFMNEQKLN\RYPASF f 

SCGLVRSKTE\DHEEAGPSYLPKVN 

LAPFLEI [ 

1457 

6954 

B 

1568 

76 

384 

MSGWGVLSGRLNPAAREKDVERFF I 
KGYGRIRDIDLKRGFGFVEFEDPRD | 
ADDAVYELDGKELCSERVTIEHAR 
ARSRGGRGRGRYSDRFSSRRPRND 
RRNAPP* 

1458 

6955 

A 

1569 

3 

229 


1459 

6956 

A 

1570 

152 

536 

PDIMSGCRVFIGRLNPAAREKDVER 

FFKGYGRIRDIDLKRGFGFVEFEDP 

K\DADDAVYELDGKEL\CSERVTIEH 

ARARSRGGRGRGRYSDRFSSRRPR 

NDRRNAPPVRTENRLIVENLSSRVS 

WQVC 1 

1460 

6957 

A 

1571 

771 

1383 

ILIEYKCGKCHVCTLSNIFSFSSLVFF 

ISCDCLCVPFPLLCLTQLSCVKDLK 

DFMRPAGE\VTFA\DAHRPK\LNEGV 

VEFASYGDLKNAIEKL\SEKEINGRK 

IKLIEGSKRHSRSRSRSRSRTRSSSRS 

RSRSRSRSS\KSYSRSRSRS\RSRSRS 

DVPVLLSRSPRA*EEPRNRGSSSRSK 

SPASVDRQRSRSRSRSRSVDSGN 

1461 

6958 

A 

1572 

236 

1377 

PDIMSGCRVFIGRLNPAAREKDVER - ] 

FFKGYGRIRDIDLKRGFGFVEFEDPR 

DADDAVYELDGKELCSERVTIEHA 

RARSRGGRGRGRYNSDRFSSRRPRN 

L>K/K VUcLi WMAALNNY W*G*PFKI 

QESLAVMILGPAV*SVLLFPR*PIVL 

DESI*VIEHKSIDGSH*NGL*YLMA* | 


WO 01/79449 PCTYUS01/08656 


SEQ ID 
NO: of 

nnrlpft-tiflp 

sequence 

SEQ ID 
NO: of 

sequence 

Me 
tho 
d 

S^PD NO: 

in USSN 
(19/770 i*n 

Nucleotide 
location or 
iirsi couon 
Tor peptide 
sequence 

Nucleotide 
location of last 
couon lor last 
amino acid of 
peptide 
sequence 

Amino aci J sequ^pr( X^Un known; *=Stop 
codon; /^possible nucleotide deletion; ^possible 
nucleotide insertion) 







TCPQLNTSAVIAFLPL*IRIFFLRNAP 
PVRTENRLIVENLSSRVSWQDLKDF 
MRQAGEVTFADAHRPKLNEGVVEF 
ASYGDLKNAIEKLSGKEINGRKIKLI 
EGSKRHRSRSRSRSRTRSSSRSRSRS 
RSRSRKSYSRSRSRSRSRSRSKSRSV 
SRSSPCPEKS\QKRGSS\SR\SKSPSHL 
WNRPEVPGPRSRSQIQLDQWPIKPV 
K 

1462 

6959 

A 

1573 

568 

770 

PDIMSGCRVFIGRLNPAAREKDVER 

FFKGYGR1RDIDLKRGFGFVEFEDPR 

DADDAVYELDGKELCS\ERVTIEHA 

RARFTRLGRGRGRYSDRFNNSAEL 

RNDRRNAPPVRPVENRLIVENLSSRV 

SWQDLKDFNMRQAGEWTLPDTTRL 

NL\NEGVVEFASMGDLRNAIEKLSG 

RELNGRKIKLIERPAKRPQ*VQQSRS 

SDPGTQKSPLGPRSRSPSPVVANLN 

SRSKK/RRGSREPGSPEPSPRSC*VGS 

SPVP*ERFFKGYGRIRDIDLKRGFGF 

VEFEDPRDADDAVYELDGKELCS 

1463 

6960 

A 

1574 

22 

202 

TKSSS*CDSVATCG1IKFSCPHNLSKI 
HDSISPRVaSICKPHGSIHKLCIKIKF 
HIFAR 

1464 

6961 

A 

1575 

1 

1878 

MQYSHHCEHLLERLNKQREAGFLC 

DCTIVIGEFQFKAHRNVLASFSEYFG 

AIYRSTSENNVFLDQSQVKADGFQK 

LLEFIYTGTLNLDSWNVKEIHQAAD 

YLKVEEVVTKCKIKMEDFAFIANPS 

STEISSITGNIELNQQTCLLTLRDYN 

NREKSEVSTDLIQANPKQGALAKKS 

SQTKKKKKAFNSPKTGQNKTVQYP 

SDILENASVELFLDANKLPTPVVEQ 

VAQINDNSELELTAVVENTFPAQDI 

VHTVTVKRKRGKSQPNCALKEHSM 

SNIASVKSPYEAENSGEELDQRYSK 

AKPMCNTCGKVFSEASSLRRHMRI 

HKGVKPYVCHLCGKAFXTQCNQLE 

NACKELHTGEKPYKCGICVIKGFAQ 

KC\QLVFHSRMHHGEEKPYKCDVC 

NLQFATSSNLKIHARKHSGEKPYVC 

DRCG\QRFAQASTLTY\HVRR\HYW 

EEKPYV\CDTCG\KAFAVLLVLFIHS 

FLRK\HTGEKPYICGICGKSFISSGEL 

NKHRRSHTGERPFICELCGNSYADI 

KNLKKHKTKV\HS\GADKTPRTPSA\ 

EDPNLGVKQDPUQKSPFNPETYGCE 

SPS*YELYPLALPLGTED\HHMLLPV 

TDTQSPTSDTLLRSTVNGYSEPQLIF 

LQQLY 

1465 

6962 

A 

1576 

42 

134 


1466 

6963 

A 

1577 

154 

768 

HVACGLLWIYVSPSAHLNLDGT1TT 

K/ENLGTVNEILLG SNPTEAELQDMI 

NEVMSDGNGTIDFPVEFLTMMARK 

MK\DTDSE/EKEIRRKHSRVFGLRVG 

NGLYLVACRNFRHV\M\TNLGRRKF 

NQIEEV*WN*SRGSQILDG*WSKLT 

YEEFVQMMTAKVRPLSRNVLNFLV 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

in USSN 
1)9/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqiTHHF( X=Unkno\vn; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







QNCFICLFLWFVTLSCKKVFSPTCQ 
KKNMHVIGN 

1467 

6964 

A 

1578 

3 

202 

RRMVSAFSCRCMPSEPCIAVTAPCM 

MAGQAPAERTAQ*PLYF/CTLFQGS 

LSPT*REVGWPLGPAGM 

1468 

6965 

A 

1579 

133 

378 


1469 

6966 

A 

1580 

891 

1744 

DMFPITPSRSVLPSFLLTYLPRQSL/N 

SVAQAGVHWCDLSSLQPPPPGFKQ . 

FSCLSLPSTW*LGLQACTTVPS*FFV 

VVVCIFFFFLVETGF/TLGWARLVLN 

L*LQ/CDPPTWASQSAG/ITDVSHHA 

QPILRFKCLS*CIRSFNEGRIKGTRQ 

NKVELSL/CFLRQESHSVTQAGVQW 

CNLSSLQPLPPGFKQFSHLTGVLPKC 

WDYRSKRPRPANQTSILGC*RKDPY 

*LHWAKKATEDIKLESCRLTPGKAR 

PTANFLRQGQFFWGPILGGLLGPQE 

GFPFLFFKGF 

1470 

6967 

A 

1581 

2119 

2436 

TTRYLKKTSTTGQRKKRGRGNGSF 

PTENLVPSGTVTGSQQLGPPFR*N/H 

TEECWGPPTADGRAGKGPRQQPGR 

AQRIYRWDPSDGTTWHHHRPCGSR 

GTDQPETK 

1471 

6968 

A 

1582 

208 

296 


1472 

6969 

A 

1583 

185 

947 

SHCSSGMEIPVDQLPSLPRAALVAQ 

NYINYQQGTPHRVFEVQKVKQSQA 

WKDI\PGKEGHKYS/HLKFAV*KKL 

YKKQVKG*TCTAVEVLLPFQRGQET 

LHQEVNFHILKEKLGKNPD\EEDNT 

F\YQRLKSMKEPLQAQNI\PD\NFGN 

VSQEMTLVLNLAWVACGL1**WQK 

FLLKTTWYK\MVKI\QTCQARCQRI 

DDF\IELDYPPFYFINIASQEIIP\WQM 

QVLWHPQYGTKVKHNSRLPKEVHL 

GYTPKPLTLEV 

1473 

6970 

A 

1584 

5060 

5662 

ESQAAPPPRPSTLHRPARATAITAC 

WSSQASGPQAVRRRLLTPLSSPAAR 

DLVSKEGFRRARHVVGEIRRTAQA 

AAALRRGDYRAFGRLMVESHRSLR 

*GPLGAPLLPGTGRPRPTPS1SSLQ\D 

DYEVSCPELDQLVEAALAVPGVYG 

SRMTGGGFGGCTVTLLEASAAPHA 

MRHIQEHYGGTATFYLSQAADGAK 

VLCL 

1474 

6971 

A 

1585 

2 

987 

LREGCPQR*RQPTQLDWPYF/CPFSP 
VC* KTKTKKPYPCAPKLGH VRCPT 
ASTLQAHTGPCPPSPQTGPQTRAGL 
LHIC VG VG ARFLFLLSLSPFG DIPL 

1475 

6972 

A 

1586 

318 

382 


1476 

6973 

B 

1587 

106 

293 

MAGRVCLCQGSAGSGAIGPVEAAI 

RTKLEEALSPEVLELSFRGTEPPTTT 

PAGPRSAGRGAGS* 

1477 

6974 

A 

1588 

442 

905 

PMLSGRLVLGLVSMAGRVCLCQGS 
AGSGAIGPVEAAIRTKLEVEALSPEW 
LELRNESGGHAVPAWAVRLHFRVG 
CW*ALVFEGLSPLTTDTGLFHAALA 
EELGRFRSHALAMPGTDPPPQWREN 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid or 
peptide 
sequence 

Amino acid scqCTWW ( X<=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







\SQLEQLAPPCLG\GKQENFLGTPLE 
PPKRG 

1478 

6975 

A 

1589 

1206 

1335 

KTQERYFNLKQN*TGQAWWLMH/I 
WEAEAKGSLEPRSLRPAWAT 

1479 

6976 

A 

1590 

136 

837 

PSEKTSPDRDKKKRAVRSILFLLELQ 

NIIIQDHDQGQEKENESQIMKEENT 

GAGAEAKREEDMNPKVKSSKKHKS 

EEHNDKEHSSDKGRERLNSSENGE/ 

AQAQTPRKEGH/RKGRGHSRSRSRE 

RRHRIRGRERKKSRSRSRERKKSRS 

RSRERKKSRSRSRERKRRIRSRSRSR 

IQRHRA*D*KAGVGPRSRSRDRKKR 

IEKPRRFSRKFKPDSQVHLPLEGRTQ 

PWDAQEAFS 

1480 

6977 

A 

1591 

209 

1545 

PYYFLQANSPPGPLLTPALLPHRILS 

DVTQGLPHAHSACLEKLKRSYEFY 

RYF\ETQHQSVP\QCLSKTQQKSREL 

NNVHTAVR/SLWQLHLKALLNEVII 

LEDELEKLGCTKETQELVSEAYPILE 

QKLK\LIQPHVQASNNCW\EEAISQV 

DKLLRR\NTDKKG\KP\EIAC*KPTCY 

SSTF*RQPTLHIADQDPIPEEQE\LEA 

YVDDIDIDSAFQNRHDFYYLSQEDK 

ERQKREHEESKRVLQELKSVLGFK 

ASEAERQKW\KQLLFSG\HAVLK\SL 

FPVGPQWEPISNSEPSMNSDMGKVS 

KNDTEEESNKSATTDNEISRTEYLC 

ENSLEGKNKDNSSNEVFPQGAEER 

MCYQCESEDEPQSKIGSGLTTNAPPT 

P\RDSLQPSIKQRL\ARLQL\SPDFTFT 

AGPCWQEVGCLDLLLPFTTMAGNR 

LFGDEEEEQIIEEN\KN\EIEEK 

1481 

6978 

A 

1592 

1 

296 

DFPLPTLLKTGPGPGF/YNGPP*GER 
FYVASPG*IWAPQGFF*KGPPSSSSS 
SSSQRSKPLFPCFANKTG*VGCFLVI 
SQRDQIPYPRPTPPTLPWLQP 

1482 

6979 

C 

1593 

15 

350 

MLISLNINQTLLYCNKTENCXXXXX 

XXXXXXXXGGPFKRTPGGPKFNRG 

WQGKIFPLKGGLLKPHWGIFXXXX 

XXXXXXXXXXXXXXXXXXXXXX 

XXXXEKGEKQPEKPGG* 

1483 

6980 

A 

1594 

11 

129 

AP WLS VL Y* SESPEAQPP YMMGPLE 
SP*AQRCPGLERGR 

1484 

6981 

A 

1595 

2 

660 

NFPTARLFRLFYPLFPLKIFIFPKAFN 

FCREVGPFCPPPK*GFFPKIPK*VFN 

RPP/SKGKSFTLPAPVKFGPPRGPFK 

RAPSSSSSSSSPVV*APWPTVLY*SE 

SPEAQPPYMMGPLESP*APHEGVTA 

WVES*GPCPA*PWGRQAAPQPPPPP 

QERAG *EPESKFGPGSK/PPERP VYA 

GNSPVLRSGLTSPSPSPAPPGGFKY 

MEERSKADLGPGMEKG 

1485 

6982 

A 

1597 

1 

680 

ESRIRRRSSRRPREPPGPSRRRRRRR 

PDPRTMPSEKTFKQRRTFEQRVEDV 

RLI\REQHPTQIPR**LERSKGNNQPP 

\VLDK\TKF\LVPDHVNMSEL\IRI\IR 

RRLQ\LNANQA\FFLLV\NGHS\MVS 

VSPHPI\SEVYE\SEKDE\DGF\LYMV 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 

11U vICU-llUC 

sequence 

SEQ ID 
NO: or 

pcpuuc 

sequence 

Me 
tho 

u 

inUSSN 

Uy/77U,loU 

Nucleotide 
location or 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqiRHR ( X=Unkno\vn; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







LCPPPE/YGLR*TLSV*N*KKKNAAS 
S*NCLNPLPKEKIKECYPTEIDQFIPI 
TDHETSSVPTLGVLGSCVCVSSRKT 

1486 

6983 

A 

1598 

2 

508 

PDSSGPHRLRENPPMVAVSCPTKTN 

VKGPPGGKVGAHAG\EYG\SEALE\R 

MFLSFPT\TKTYFP\HFDL\SHGLCPR 

LKGHG\KKVADALTANAVAHVD\D 

MPVNGVVRP*SDLH\AHKL\RVDPVN 

FKLLSH\CLLVTLA\AHLP\AEF\TPA 

VPRPPWDKFPWLSVKHRCLTFKYR 

1487 

6984 

A 

1599 

295 

758 

VLSRKCQRSLTAFSSKCPNSWFSITQ 

TECKTMTCGMPQHVTQQ*RPIINTS 

HQYSVKLGHP\DTL\NQGEFKELVR\ 

KDLQNFLKKENKNEKVIENHIMRGP 

GTQNAAQPAELSREFIMLMGEA*PG 

AFPRRKIARGLTEGPGHPNK*PGPG 

GGAPP 

1488 

6985 

A 

1600 

411 

1259 

SQGTTSRGSWEFPHSPEIEETSCLAE 

LFEKAAAHLQGLIQVAKQGATLCT 

LYAKYKQVKVGNCNTPKPSFFDFE 

GKQKWEAWKALGDSSPH\QAMQE 

YIAWIKLDPGW/ISSDIQRRNGKEA 

NTGFGGPV1SSLYHEETIREEDKNIF 

DYCRGKQH*PYNQKPSNPKNVDVN 

VKDEEGRAP\LHWGL*SEDIKELVH 

SVAANIELTLNCQD\NERPKQALHY 

ASACGVSGIL*ELAAPSLGADPDSPR 

PGWLPARGGDRLQNSFFGAAAAHN 

WQGLIKRLENCSL 

1489 

6986 

A 

1601 

177 

409 

FLQASGILKGFEPNLL\NLVLTVTI*Y 
MRDPDDQYKLTGGHPGKLGLWFR 
G\TSLVL\ICPQDGMEAIP\NPFIQQQ 
DA 

1490 

6987 

A 

1602 

1 

165 

PLKRSDGCNDGRPTRPPTRPDTTVF 

TSNLKQTRMVHLTPVE\KSAVTAL 

WGKVNVDEWGGKALGRLLVVLP 

WDPKRSFQSPLGESVPTP*MVHLTP 

VERVCRYCPVGQGERG 

1491 

6988 

A 

1603 

240 

461 


1492 

6989 

A 

1604 

2 

206 


1493 

6990 

A 

1605 

2940 

3296 


1494 

6991 

A 

1606 

189 

736 

ENKISSVFKADFLPPAPCSLPGLEVS 

VSPKGKNTSGRESGFGWAIWMEGL 

VFSRLSPEYYELA\RPHLRDEEKS\CP 

C\LAQE\GPQG\DLLTKTPELGP*ITR 

TC\LTI\VQKT*RKMVDKP\TQRSVS 

NAATRVCRTGRSRWRDVCRNFMR 

RYQSRVTQGLV\AG\ELAQQNLVST 

SRLC1PSTGPL 

1495 

6992 

A 

1607 

3 

452 


1496 

6993 

A 

1608 

3 

485 

PTLLVPTDSERTHPWLLSPADKYTN 

VKGPPGGKVGAHAVRSMCAEALE 

RMFLSFPTTKTYFPHF\DLSHG\SAQ 

V*GPRARKVADAL\TNAVAVHVGRT 

LPNALVPPLSDLHAHKL\RVGPGSTF 

KLLKATCLAGLTLXAAHLPARVQPL 

AVASLPWDKVSWASC 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 
peptide 
sequence 

Mi 
the 
d 

: a^TDNO: 

> inUSSN 
09/770,160 

Nucleotide 
location of 

for peptide 
sequence 

Nucleotide 
location of last 

f*r\rlnn l/ii* lact 

V.UUUH iur luai 

amino acid of 

peptide 

sequence 

Amino acta scqumee ( X^Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 

1497 

6994 

A 

1612 

77 

636 

QPQTDTM VHLTPVEVKSA VTA L WG\ 

KVNLDE\VGGKALGRLL\VV\YPWD 

PQKVL*NPLGESVPTP*MLVKWGKP 

PKGEKAQWQKKKCSGAL*VNGPGL 

T/HGQPSKGTFGPH*SE\LHC\DKLH\ 

VDP\ENFRLPGQQCLVCVAGPITLG 

KEFTP\QLQACLFRKLVA\GVANAL 

GPTSNHLSLAFLAGP1SN 

1 Af\0 

1498 

6995 

C 

1613 

167 

391 

MNVFMCRLGTTFHLVLLLPSVLPSL 
RKTVFLNPFSIKQRFQRWKHWVFQ 
VASELTDAILSSCGHLFLPGSHNLS * 

1499 

6996 

A 

1614 

1402 

1871 

GLQGSQSLHIPSLTGLRHAC1TLGKT 

AHSSRLHPSPAPPP YL* STDTRDNN 

APEPTPPRSWTWRA*/PMGRGSSQE 

GQASQQPWPGEGKSGCEMPP\LVY 

KVKPEP*P/SPDPWGL*QSMPLDYL 

HLSVILRWRRGGGQWQGATKISRR 

DRRGGALLLHL 

1500 

"6997 

A 

1615 

8 

551 

SAQMAVTTADPRVRPRVRTQLCSL 

ASLIQTLLVHLTPVE\KSAVTALWG 

KVNVDEWGG EDLSNRLLPVYP WTQ 

TFFD\SFADQSTPDAAMGNPKVKAH 

S\KKVLG\AFSGGPGCTWDNLKGTF 

AHTEVSLHC*QACTWDP\ENFRLLG\ 

NVLVCCCWAHSLLGKEFQPHQLQA 

CLIKKIGWLGVG 

1501 

6998 

A 

1616 

3 

389 


1502 

6999 

A 

1617 

1 

672 


1503 

7000 

A 

1618 

18 

621 

RSLRCSRHSCLATSSPLPCARRAWH 

PARGKADQPFCRSAGPSVPAAQPR 

GENREKEETTRIGPGVMESKEKRAV 

NSLSMENANQENEEKEQVANKGEP 

LALPLDAGEYCVPRGNRR\RFRVRQ 

PILQYRWDMMHR\LGEPTGQGMKR 

RE*WKRIGEEVRQLMEKSWRGKSQ 

L\SH\SLRGESGTDPPSPMTHHD*VF 

ALMPLNP 

1504 

7001 

A 

1621 

3 

700 

HASDRRHHGSHACSRVSSGHQAGL 

LGGGWEEDRECGQRAEGMMFWA 

ALALAATSPSRLLLSPGKGPVPSRLL 

PLSDAASPTWLKLTI*RR*RSQIYQT 

G\QEGPLLPSQIGVI\LRDSHGVA\QV 

RFWTGNKIL\RILKS*GTCS*IFLIDL 

YHLN*ESQFAVRKGILRREQERDKG 

G*NFPSDF*IGEAGFHPFWLRYYKD 

QAESSLPNWEIWNHLTASALGR1NL 

VWCTPSN 

1505 

7002 

A 

1622 

1 

340 

GEHSMAPPAHFRALLYHPGTATLV 
PHPASISQHSPSPWGNA/RG*PV*RQ 
RHLTAPRSPPHPRFRHKPGKDPREN 
PSRWPEVPSLPQTHVVPGQAAWDT 
VNTTVCKNRSTKPQD 

1506 

7003 

A 

1623 

3 

1076 

HPVPSSSYSVHTLSPAAMTEQMTLR 
GTLKG HNG\WVTQ\I ATTPQFPENNL 
PASRVEKAIILRKPARGWSQPMNFQ 
RRSRRIHSHLLRMVVIS*DGQFC/AF 
QGFWDWKPCALGDLTNG/TPTRGR 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 

sequence 

SEQ ID 
NO: of 

pepuue 

sequence 

Me 
tho 
el 

I^PbNO: 

inUSSN 

fiornn ICO 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
cuuun lur msi 
amino acid of 
peptide 
sequence 

Amino acid seqraiR ( A=Un known; *«Stop 
codon; /= possible nucleotide deletion; V= possible 
nucleotide insertion) 







FCGPIPKDVLSVAFSSDNRQRLSLGS 

RE*THQSLWN\TLG\VCKFNLSRNES 

H/SESWVLCV\RFLAPTTSKPLSIGLP 

VAWDK\LVQV\WKPGLNCKADAPN 

PHLATTGFUNTGDLSSSRWDPLCA 

SGG\KDGQA\MLWDL\NEGQPPLQR 

*NGGDI\INA\LCFSPN\RY\WLC\AAT 

GPSMKIWDLEGK1IWDELKQEVISTS 

SKAEPPQCTSLAWSADGQTLFAGY 

TDNLVRVWQVTIGTR 

1507 

7004 

A 

1624 

1 

1189 

LQGGGRRGCGASFSKPSSAILVAAA 

THALAAAMTEQMTLRGTLKGHNG\ 

WVTQ\IATTPQFPENNLPASR\EKAII 

LRKPARGWSQPMNFQRRSRRIHSHF 

V\SDVVIS*DGQFC/AFQGFWDWKP 

CALGDLTNGHPHEGDFVGPYPRNV 

LSVA\LSSDNRQRLSLGSRE*THQSL 

W\NTPGWCANTTVPG*EPTQEWV\S 

CVPLPRPNNQQTPIIVLLWPGNNLV 

QVMETWANCKLKDPT/NHWPTPGY 

SETPVTVLSRMDPFCA\SGGQGMAQ 

AHVYGDL\NEG\KHLLHARMVGTSI 

NA\LCFSP*PATWLCAATGPS\IK1W 

GFRGERSIVDELKQEVISTSSKAEPP 

QCTSLAWSADGQTLFAGYTDNLVR 

VWQVTIGTRLEVYGRALPIKKKTGF 

SEKKKK 

1508 

7005 

A 

1625 

3 

445 

GEFADSF/SSMGSPVNAQDFCTDLA 

VSSANFIPTVTAISTSPDLQWLVQPA 

LVSSVAPSQTRAPHPFGVPAPSSGA 

YSRAGVVKTMTGGRAQSIGRRGKV 

EQETDQLEDEKSALQTEIANLLKEK 

EKLEFILAAHRPACKNPDDLGFPE 

1509 

7006 

A 

1626 

7 

514 


1510 

7007 

A 

1627 

3 

462 

RRSERAVTVLLPSSASQRPPVSAPRP 

LARLCLTATMMFSGFNADYEASSS 

RCSSA\SPAGNSLSYYHSPRRPPFSA 

WGSPVNAQ\DFCTGPGPFSSANF\IP 

TGHLPSWTSPD\LQWLVQ\PALVSS 

VUPSQTRAPSTFSESPPPTAGA\YSR 

AGVVKTMTGGRAKSIG\RRGKVEQ 

LSPEEEEKRRIRRERN\KMAAAKC\R 

NR\RRELTDTLQA\ETEQL*DERTAF 

WT\RMSHPVEEEGKLEFILAAHR\PA 

\CKIPDDLGLPRKRMSVASLDLTGG 

LPRGLPPRRSEEAFTL\PLLNDP*/DP 

KPSVEPVK\SISSMELKTEPFDDFLFP 

ASSRPSGSETARSVPDMDLSGSFYA 

ADWEPLHSGSLGMGAHGHRGWEP 

LCTPVVTCTPSCNCLHVFLRLHLPR 

G* LLPQLCSCPPQGQQQQ* AFL* LA 

QLTHAAGPVRGQGRGGSRHPQVXPL 

PELVHYREEKHIFP*RVPRPGIASLT 

TTHPADLLFQHGARLSTRRDFCTGP 

GPFSSANFHSHGSLAILDQSGPCSG 

WCKPALVSSVGPIADQSPFNLFGVP 

TPYRWGLAPGLAL 

1511 

7008 

B 

1628 

43 

674 

MDWTWSILFLVAATTGVHSQVHLV 


WO 01/79449 ^ PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: of 

peptide 
sequence 

Me 

tho 
ft 

in USSN 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqifflHP( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; V=possible 
nucleotide insertion) 







QSGAEVKKPGASVKVSCKASFNSF 

DTYGFNWVRQAPGQGLEWMGWV 

SAFNGDTNYIRKLQGRVTMTTDSST 

STAYLELRSLKSDDSAIYYCAATNS 

DKYFWGQGTLVTVSAASPTSPKVF 

PLSLCSTQPDGNVVIACLVQGFFPQ 

EPLSVTWSESGQGVTARNFPPSQDA 

FGDLYTTSSQLTLPATH* 

1512 

7009 

A 

1629 

3 

1639 

SPGIFRGFQSVIRTEQRELTMESGLN 

WLLLVAVLKGVQCEVQILESGGGQ 

VQPGGSRTLSCAASGFIFSNYVMTW 

VRQAPGKGLEWVSSTAASGANTFY 

AESVKGRFTVSRENSENMMYLQMS 

SLRDEDTGIYYCAKDGDVPNLGVA 

WIVAGPGNVRPRKWFDAWGQGTT 

VTVSSASPTSPKVFPLSLCSTQPDGN 

VV1A\SCLVQGFFPQEPLSVTWSESG 

QGVTARNFPPSQ\MASGDLYTTSSQ 

LTLPATQCLAVPKSVTCHVKHYTNP 

HPDVDGVPCPVPSTPPTP/CSLNSTYP 

ISLMLPPPTVTAPTGPSKDLFLGSEA 

NLTCTLTGLE\NASGCHFQSEGLQV 

GKSAVQGP\PEA*PSVAAYSVVQLS 

CRGWREAMEPLVRPFTCTAA\HPV 

VQGPALTAHPLQNPGNTFPGPRVHP 

FAPAVGRVCPFNDLLT\LHCLAR\AF 

SPQGPCWVRWLQGSPKLPPRKST*L 

G/PFPAGAQARAPTTFAVTSILGR/V 

QPEDWEE/EGTPFSCMAGH\EALAL 

AFTQKTIDRLARKPTHVNVNSAVMP 

EVDGTCY 

1513 

7010 

A 

1630 

3 

497 

SSGPTRLRENHPWLLSPADKTTVKG 

PL/WGKVGAHAAEYG\SEALGEGFS 

LSFPQPPKTYFPATSDLE/HNGFAPG 

LKGHGQRKFGRTR*PKSRGGNVD\T> 

MPQTALSAPERPCTAHKL\RVDPV\ 

NFQASLSHC/LCLVTLAAHLPAEF\T 

PAVRRLLWSKFLASWSTVL 

1514 

7011 

A 

1631 

9 

489 

NSARATDSERTHHGARLLPDKTKA 

QRPPRLKLGANA\GEYG\SEAL\ERM 

FLSFPNPPKTYFRQFRP*ANGFAQG* 

RGHGQRRWPDALTQ\AVA\HVDEH 

APNGAVRP*SDLH\AHKL\RVDPVN 

FQASLSHCLAW*PWPAHLPRPSSTP 

GGATPSLEQSSWASC 

1515 

7012 

A 

1632 

2291 

2960 

INCPAQAKVADILQFNFKKFVCLF/D 

FLRQSLALSPRLQVQWNYDLSSLQSP 

SPRFKQFSCLRLPSS*NYRCASPRPT 

NFLIF/M*RWGFTMLARLVFVLLTSS 

DPPTSA/SHSAGITSVSHCTRPLQSIFI 

*PLEQVS*VKDKNNNKKTHFFVLFC 

FLRQSHSWTQAGMQWHDQSSLLL 

QPPRLKQFSHFRLLSSWYYRCLANF 

*IFCKDGVLLCCPW*SQTPGLK 

1516 

7013 

A 

1633 

1 

1233 


1517 

7014 

A 

1634 

233 

884 

ESPGVGCSARRGPRPRSPGPPPAAP 

GTPRPHGIPLYTRAGHQ**GEIRRRP 

CTFISKFLRPQGGSASERQLPDLQAR 


WO 01/79449 


PCT/US01/08656 


SEQ 10 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 
tho 
d 

slQH> NO: 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
Tor peptide 
sequence 

Nucleotide 
location of Inst 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequ^W( X=Unknown; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







A WQELLGRPFNKHXHWFPR* SPCKG 
IGVTRCI\RTNP*KWIPLIGPGQHSAIG 
LSSQELFRLLP\SEL\TLWG*PPIEVSY 
RIGEDGSHLCACMKPSPA\GGST\QN 
QT\NVQMV\DS\RISCKEELLLGRTEP 
FPKTTNMMTVSG 

1518 

7015 

A 

1635 

2 

402 

SQTQREPTMVLSPADKTNVKAA/W 

GMFLSFPTTKTYFPHFDLSHGSAQV 

KGHGKKVADALTNAVAHVDDMPN 

ALSALSDLHAHKLRVDPVNFKLLS 

HCLLVTLAAHLPAEFTPAVHASLDK 

FLASVSTVLTSKYR 

1519 

7016 

A 

1636 

2 

522 

GLEFGTSHRLRENPPWCLSPADKTN 

VKA\AWGKVGAHAGEYGAEALER 

MFLSFP\TTKTYFPHFD\LSHG\SAQG 

♦RAHGK\KVA\DALTK\AVA\HVDD 

MP/HNGAVPPLSDLHGAQAFGWDP 

VQLQSS*SHLPCW*PLARPPSPAEFQ 

PLAVATSSLGQSFLGFLLKHRCLNL 

PNYR 

1520 

7017 

A 

1637 

344 

742 

GFLIGVNEKTCFFTSPMLHDSYFFFL 

VNVIRCHFICGTYLYWAKHIFSFVPF 

FLSFLFTSFISLFLPFPFFLFFFFFFW* 

LLLPTPFYVSF\MKG*SFNF*FF1FKC 

RLLTLLQNIK*TREMTTFDYFLSVFL 

1521 

7018 

A 

1638 

1 

519 

PLKRSDGCNDGRPTRPPTRADTTAY 

TSNLKQTLLVHLTVTEEKSAVTALW 

GKVNVNEKVGGKALGRLLVVYPWT 

QRF\FESFGDLSTPDAVMGNPKVKA 

HSQESSPRGL*WWAWLTWDNLKG 

TFAHTEVSLHCDKXLHRGSLKNFRL 

LGQRAWSVVAGPIHFWQKNFNPTS 

CRLA 

1522 

7019 

A 

1639 

3 

452 


1523 

7020 

A 

1640 

3 

484 

PTLLVPTDSERTHPWLLSPADKDQR 

QGPAWGKVGAHAVRSMCAEALER 

MFLSFPTTKTYFPHFDLSHG\SAQV* 

GPRARKVADALD/TNAVANVGRTL 

PNALVRPL\SDLHAHKL\RV\DPVQF 

SSFL\SHCLLG*PWAVHLPRPSFNPW 

RLQGFLGDKVSWAFC 

1524 

7021 

A 

1641 

180 

613 

SFAGISNGLAGRSVKDSGKAQ\AKA 
VSR\SQRAGLQSQWGR\INRH*KSRD 
AS\HERGGATA\AVYSAUILE\YLPQ 
KVLELAGKASKDFKGKAYYPLRHL 
Q\LAIRG\DEE\LDSLIK\A1MAGGGVI 
PTTSHQISDRGGKKKDNQKTV 

1525 

7022 

A 

1642 

107 

368 

IY1ILRD*VLSTFVCFILCKA1YKNIW 
TAFWKCS*ILICSI/LCNYVCTCTSVY 
ALCYIYIIDLR*QQTYLCESKCTCIC 
MYVCIFLC 

1526 

7023 

A 

1643 

790 

1252 

CAKPETQNNGNLRVRLRPLHFGHT 

LN*VRT*LKRR1FFFLRQSLALSPRV 

ECSGMISAHCKFCFLGSGHSPASAS* 

VAGTTARRQHAWLFLCVFSRDEFH 

RISKDGLNLL/NLVICPPRPPKVLGL 

QHEPPCPAKRRNFLSKIMGGHCFEL 


WO 01/79449 ^fe, PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 

SEQ ID 
NO: of 
peptide 

cpniipnrp 

Me 
tho 
d 

in USSN 
09/770,160 

Nucleotide 
location of 
first codon 

sequence 

Nucleotide 
location of last 
codon for last 

flminn ftp id nf 

peptide 
sequence 

Amino acid seqUMKt ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; \=possible 
nucleotide insertion) 







SSCN 

1527 

7024 

C 

1645 

171 

329 

mgarasgrsplcqvgrqeqglr.gr 
ngplpasrlpqqpqvvpatpqsprd 

DLA* 

1528 

7025 

B 

1646 

196 

514 

MGRDTGPELCLHDSSLPAADDGPSL 

PTKQNEEFRPFIRRLPEFKFWHAAT 

KGILVAMVCTFFDAFNVPVFWPILV 

MYFIMLFCITMKRQIKHMIKYRYIPF 

THGKRX* 

1529 

7026 

A 

1647 

43 

321 


1530 

7027 

A 

1648 

35 

821 

GRPRLGRSGAASYRMSEGDSVGES 

VHGKPSVVYRFFTRLGQ1YQSWLD 

KSTPYTAVRWVVTLGLSFVYMIRV 

YLLQGWYIVTYALGIYHLNLFIAFL 

SPKVDPSLMEDSDDGPSLPTKQNEE 

FRPFIRRLPEFKFWDASVCGDGRCS 

CKAGGGRQCPVLAADAALTFSPHL 

K\HAATKGILVAMVCTFFDAFNVPV 

FWPILVMYFIMLFCITMKRQIKHMI 

KYRYIPFTHGKRRYRGKEDAGKAF 

GQLEAGLRLPHVLQEQF 

1531 

7028 

A 

1649 

156 

818 

HSYRMSEGDSVGESVHGKPSVWYR 

FFTVRLGQIYQSWLDKSTPYTAVRVW 

VVTLG\LSFVYIDSEFTCLQGWYIC* 

PMPLGIVYHLNLFHKLSFSPKWDPSL 

M\EGLQDDGSFGYPTKQERREF/RA 

PFIRKGSPGVLNFWHAGYPRASLVA 

LWSCTF\FDAFQTSPVFW\PIL\VMYF 

HHASSCITDGRRANSRHMDLSYRY 

MPVSHMGK/RGRYRGKEDVAGKAF 

AS 

1532 

7029 

A 

1650 

154 

685 

PPLHLRDCFSPPGRALSPVGFYPYR\ 
RSVP\TWLKLT\SDDVKE\QIYKLGQ 
EGPLLLSQIG\VILRDS\HG\VAQVRL 
G\TGHDTFKI\LKSKGLAPDLPEDLY 
HLIK\KA\VAVRKH\LERNRKD*GC* 
NSRLILIESRISPFWLRYYKYTKRWL 
PPNWEIWNHLTASAPGRINLVWCT 
QAIK 

1533 

7030 

C 

1651 

127 

435 

MAASXNPEVLDITEETLHSRFLEGV 

RNVASVCLQIGYPTXASVPHSIINGY 

KRVLALSVETDYTFPLAEKVKAFLA 

DPSAFVAAAXLGCCHHSCSXCCCSP 

S* 

1534 

7031 

A 

1652 

1 

689 

KCFI/VGADNVASKQMQQIRMSFRG 

KAVC*WGKNTMMRKPIRGHLENNP . 

ALEKLLPHIRGNVGFVFTKEDLTEIR 

DMLLANKVPAAARAGAIAPCEVTV 

PAQNTGLGPEKTSFFQALGITTKISR 

GTIEILGVRNVASVCLQIGYPTVASV 

PHSIINGYKRVLALSVETDYTFPLAE 

KVKAFLADPSAF/VAAAP/VAAATT 

AAPRAAAAPAKVEAKEESEESDED 

MGFGLFD 

1535 

7032 

A 

1653 

68 

1110 

RTAVMPREDRATWKSNYFLKIIQLL 
DDYPKCFI VG\ADNVG SKQMQQIRI 
VPWGEACVLMGQKTMVMGQAHPK 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQ ID 
NO: or 
peptide 
sequence 

Me 
tho 
a 

in USSN 

0y/77U»lo0 

Nucleotide 
location of 
first codon 
Tor peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqo^PT( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; V=possible 
nucleotide insertion) 







GTL*NNPSLWRKLLP\HIRG\NLGFC 

FTQGGPSLEIKGHVCLANKGLPSWL 

PVVGANCPHGEVTGGQPQNTG/LSG 

PEKTSFFPGL*AITTK\ISQGAPIENPE 

VNVPAESRTGDQSGKPSEANAA«TC 

SNISPFXSFGAGSSQPGVSTNGSHLPP 

LKGLDIHRRNLCIFWLSWRGVRNV 

AKCLSARLAYPTVA\SVP\HSIINGY 

KRVLA\LSV\EPDYTFPLAEKVKAFL 

\ADPSAFVVAA\PVG\VAPPAAPGG\ 

VAAPAKVEAKEESEESDEDMGFGL 

FD 

1536 

7033 

A 

1654 

1 

364 


1537 

7034 

A 

1655 

70 

253 


1538 

7035 

A 

1656 

151 

657 

APTTPTGPGDPLDLVALAEQVQKA 

DEFIRANATNKLTVIA\DQLQ\HLPE 

QARKVLEEAHRDANLHHVACNIVK 

KPGDIYYLL*TGRVGQQYFSIISPKE 

LGDKFVHIDFLGAYKLQHDLSWTP 

V*GH*EGKMLKISMMGHVAKPVSG 

PASVHWNPTFQGTDSLEEWGF 

1539 

7036 

A 

1657 

345 

547 

GFKPPDFFLCNENKWRKTNTSSFSE 

Y*CLMHIHLLIFNITIFSS/LHTYIHKH 

THTHTPFSVFIMEGC 

1540 

7037 

A 

1658 

228 

900 

PSQAGNTSPSGARSSFPKDMKLLEN 

SSFEAINSQLTVETGDAHI1GRIESYS 

CKMAGDDKHMFKQFCQEGQPHVL 

EALSPPQTSGLSPSRLSKSQGGEEEG 

PLSDKCSRKTLFYLIATLNESFQA*L 

*LQHSPQPLSSAGSPALSWLG*MQS 

TAVCSQLCGEDFKDLKPQLWNAVG 

RGDLPGLKCDIYS\YNPYLDSDP\FG 

EDGSLWSFNYFFYNKRLKRNRLL 

1541 

7038 

A 

1659 

35 

1288 


1542 

7039 

A 

1660 

1 

1641 


1543 

7040 

A 

1661 

212 

369 

HPVTVYLLLGYLLFQLPCGSEFSTSE 
THGHSADRLG\AAFAVSRLEQDEYA 
PG 

1544 

7041 

A 

1662 

63 

255 

VLMFSSSHG*GYQSS\RLQCKLQIVQ 

LIQDILLFFSF*IPE*LLS*LTPLKIFPL 

HQNGPSDFVS 

1545 

7042 

A 

1663 

169 

391 


1546 

7043 

A 

1664 

85 

1534 

KSSHCIKMGPQIFHKTSELVLPATSC 

PSCPDQNEEDVSQTQYKECCG\GG 

WCSHSIFAVWVHFI*RPDAT*FGLEQ 

RLTGLLASGPVSLREVV*LYSSLGT 

VISGK*KTSNVG*RGLALGSWAFSD 

KYSWFTMFTWACISGPTKALXTTGV 

\GLIAF\GQCDVrVAGGVELMSDVPI 

RHSRKMIKLMLDLNKAKSMGQRLS 

LISKFRFNFLAPELPAVSEFSTSETM 

GHSADRLAAAFAVSFL\DQDEYALR 

SHSLSKKAQDEGLLSDVVPFKVPGK 

DTVTKDNGIRPSSLEQMAKLKPAF1 

KPYGTVTAANSSF/LLTDGASAMLI 

MAEEKALAMGYKPKAYL/RRDFM 

YVSQDPKDQLLLGPTYATPKVLEK 


WO 01/79449 PCT/US01/08656 


SEQ ID 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
Iho 
d 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
location oflast 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequence ( X=Unknown; *=Stop 
codon; /^possible nucleotide deletion; ^possible 
nucleotide insertion) 







AGL\TMNDIDAFEFHEAFSG\QILAN 
F\KPMDSDWFAEVNYMG*KKPRFGL 
PPLWRRFNNWG\GSLS\LGHPFGT\T 
GCR\LV\MTAANRLRKKGGQYGL\V 
AACAPG\GQGSATDYVEAYPK 

1547. 

7044 

A 

1665 

294 

823 


1548 

7045 

A 

1666 

3 

1171 


1549 

7046 

A 

1667 

1419 

1801 

TMEIHPIEQLDPSDHLESTAAGQEA 

LFTYHSF*STFISFFETGPHFVT\RLK 

CSSAIITHYSVELLGSSHPPTSASWV 

AGNTGVCPHVQLIFLFFVEMGSHY 

VARLVFNS\GLV1FLPWLPKVLGLQ 

V 

1550 

7047 

B 

1668 

68 

471 

MVRKL1VPRAEAAEAGGLPELGGH 

RPQPARAARAAALTGCSGGEDYTR 

YNQLSRAVPVCSRLGAHARVRWEL 

CDFVTASSFCRRRLPTVLLKLRMAQ 

HLQGSIALL* 

1551 

7048 

A 

1669 

2 

359 


1552 

7049 

A 

1670 

1 

585 

PRGVIGHGPLGTSFIGKYGCGDYW 

VKAFLDRPSQPNXQGPKKNFEWWD 

LVDVNTP\DLMA\PVSAKKERKVSC 

MFIPDGRVSVSARIDRKGFCEGDE1S 

IHADFENTCSRJVVPKAAIVARHTY 

LANGQTKVLTQKLSSVRGNHIISGT 

CASWRGKSLRVQKIRPSILGCNILR 

VEYSLLIYVSVPGSKQVFIKAL 

1553 

7050 

B 

1672 

21 

410 

MPSKVRCXSVQVFDAMKTATAVA 

HCKRGNGLIKLLEPVLLLGKERFAG 

VDTRVRVKGGGHVAQIYAIRQSISK 

ALVAYYQKYVDEASKKEIKD1LIQY 

DRTLLVADPRRCESKKFGGPGARA 

RYQKSYR* 

1554 

7051 

A 

1673 

1 

456 

MPS/KGPLQSVQVFGRKKTATAVA 

H/CKRGNGLIKVNGRPLEM/IEPRTL 

QYKVLGSGTGVSGWRTLGDRDW 

ALESWGAGISNGMFRSCVGCRQWA 

AGASSASRQERFAGVDIRVRVKGG 

GPWPRFMSKKFGGPGARARYQKST 

DKPIVTQNSLV 

1555 

7052 

A 

1674 

172 

661 

LLEPVLLLGKERFAGVDIRVRVKGG 

GHVAQIYGESQELGAWRRWLWEG 

GLHSAPVPFNCVSFSQLSVSPIS\KAL 

VAYYQK\WSEHGSFP*GRWVCGDQ 

VKDSV*LSKSSSLLFLPDVDEASKK 

EIKDILAQYDRTLLVADPRRCESKK 

FGGPGARARYQKSYR 

1556 

7053 

A 

1675 

27 

554 

STLGAMPSK/GVPLQSVQ\VFG\RKK 

DSGQLLAH\CKRAVNGLIQG*TGGPL 

EMIEARARLQYK\LLEP\VLLLGK\E 

RFAG\VDIPCPV*KGGWSTWPQIYAI 

RQS\ISQKPLVAYYPEM*VSMGPSH 

EyYVDEAFQRREIKDILHPSYVDRN P 

AGLAGPFVRCE\SKKF\GGPGA\RAR 

YQKSYR 

1557 

7054 

A 

1676 

192 

836 

ALIMSFIFEW1YNGFSSVLQFLGLYK 
KSGKLVFLNGLDNAGKTRLLHMLK 


WO 01/79449 PCTYUS01/08656 


SEQID 
NO: of 
nucleo-tide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 

S^H> NO: 

inUSSN 
09/770,160 

Nucleotide 
location of 
first codon 
for peptide 
sequence 

Nucleotide 
locution of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid sequmr( X=Un known; *=Stop 
codon; /=possible nucleotide deletion; \=possible 
nucleotide insertion) 







DD\RLGQHVPT\LHLT\SEELT\IAGM 

T\FTTFDLVGHE\QARRVWKN\YLPS 

QLMGFVFLVDCA\DHFSPSWNPKVE 

LNALMTE*NNIPMCPILILG\NKVIDR 

TDAISEEKLREIFGLYGTGPQERGEL 

*PLKEL\NARPM\EVFHVAVLLKEG 

KVYGRGFSAGLLPVLFD 

1558 

7055 

A 

1677 

196 

520 

DTVSRKNKSGKIFQLSSRV'IYERSQ 

SGVKVYKCKTFGKAFTQ/HF*AHM 

RMYTGEKPYKY*ECGKFFILVLLLL 

LMIQKYFHLIKIVRLYLIRKKVSCKQ 

PSNKILQS 

1559 

7056 

B 

1678 

1 

1521 

MG1RVTSV1VSRPVPHSEAVFCCWL 

LGATDVWIPEHPANPRLTFPLFPESP 

GRHLREIKLQSARDASVKSAKNTR 

VIPKPQRVIHPGGQPTDKMDLDLLD 

LNPRIIAAIKKAKLKSVKEVLHFSGP 

DLKRLTNLSSPEVWHLLRTASLHLR 

GSSILTALQLHQQKERFPTQHQRLS 

LGCPVLDALLRGGLPLDGITELAGR 

SSAGKTQLALQLCLAVQFPRQHGG 

LEAGAVYICTEDAFPHKRLQQLMA 

QQPRLRTDVPGELLQKLRFGSQIF1E 

HVADVDTLLECVNKKVPVLLSRGM 

ARLVVIDSVAAPFRCEFDSQASAPR 

ARHLQSLGAMLRELSSAFQSPVLCI 

NQVTEAMEEQGAAHGPLGFWDER 

VSPALGITWANQLLVRLLADRLREE 

EAALGCPARTLRVLSAPHLPPSSCS 

YT1SAEGSWGTNCKQNTRLH1SPET 

AGPAAHAACWPQQDTVRAGHSES 

WHASCCNPDTDMQGQTISTSVNQQ 

QEAQAKPPPTPL* 

1560 

7057 

A 

1679 

991 

1367 

AVLVFNNNGEANEGSGPRGP*GERS 

SRARPP/SGPGPWNCAPPRPWCPPL 

RGWSSVSWD*TAQAKPVCKSP/AG 

GSSPGTGSPSAPSPPGAGTEPAWAG 

PAELPG VFSLLNVPLSLSCLIF* SLAC 

LA 

1561 

7058 

A 

1680 

313 

429 

CIESMVHGGENIFPAGHGGSHL*SQ 
HFGRPRQVDHLRSG 

1562 

7059 

A 

1681 

552 

792 

GSASDYQSGIRTVGPRDWLCRRRA 
LDLDAARTQSVRAAEGKCAFLQMQ 
GPRVYTGPGRPRRADHLRSGV *DQ 
PGQHGETP 

1563 

7060 

A 

1682 

508 

1085 

cqhfgrprradhlrs\gvraqpgq 

hgetpsllkilkla/ghggapl*sql 

lgrlrqenhlnpggrgcgeprshh 

ctpaweterdsiskkktktqvvici* 

slnlvreiknkigltae*ilqknsel 

edvsieii *negqrdgkkmdraflrs 

mqqyqavqymcnqsprrvgdrm 

grinsqkkcktytmnhkhc 

1564 

7061 

C 

1683 

46 

516 

MLSDPPARIRTRKGPTETVSRIPRPN 
SPNGQGPVDSSPSGXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxx 

XXXXXXXXXXXXXXXXXXXXXX 


WO 01/79449 PCT/US01/08656 


SEQID 
NO: of 

mirlpft-tirtf* 

sequence 

SEQID 
NO: of 

nfnfiHp 

sequence 

Me 
tho 

»NO: 

inOSSN 
09/7 7 f> 1 £f) 

Nucleotide 
location of 
first cotfon 
for peptide 
sequence 

Nucleotide 
location of last 
codon for last 
amino acid of 
peptide 
sequence 

Amino acid seqTHBR ( X=Unknown; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
nucleotide insertion) 







xxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxx* 

1565 

7062 

A 

1684 

2 

100 


1566 

7063 

A 

1685 

2 

814 

PGYKKGPQETGSRIPRPNSPNPLRVL 

WTAHLVAMAPGSRTSL\LLAFALLC 

LALGFKEAGCPSQTVP\LSRLF\DHG 

MVQGPSRVTSCAIDYTYQGFEETYIP 

KDQKYSFLHDSQTSFCFS\DSIPT\PS 

NMEGSATRNPILELLRI\SLLLIE\SW 

LEPV\RFLRSIVPPTTWVYEHLGTAI 

DYHLLAKDLEGGHPTV* WGRLEDG 

KPPDLGKILKQTYSKFDTNSHNHDA 

LLK\NYGLLYCFRKDM\DKVE\TFLR 

MVQCRXSVEGSCWLLGCPSSIL 

1567 

7064 

A 

1686 

3 

452 


1568 

7065 

A 

1687 

3 

516 

PTLLVPTDSERTHHGACLLPDKTNV 

\KAAWG\KVGAHAG\EYGAEALER 

MFLSFPT\TKTYFPHFDL\SHGFCPGL 

RATGKKVADALTKRRG/HTWDDM 

PKRRCPP*SDLHAHKL\RV\DPVQLS 

SS*SHLPCWVTPGPAHLPRPSSTPGG 

CKASLGQSFLGFLLKHRCLNLPNNV 

1569 

7066 

A 

1688 

3 

409 

SNFRSNFGYMPLKHLADRVAMYV 

HAYTLYSAVRPFGC/SGYWGCAIGK 

ARQAAKTEIEKLQMKEMTCRDIVK 

EVAKIIYIVHDEVKDKAFELELSWV 

GELTNGRHEIVPKDIREEAEKYAKE 

SLKEEDESDDDNM 

1570 

7067 

A 

1689 

2 

437 


1571 

7068 

A 

1690 

126 

409 

ILLL WMDI LICTDFGSVN YFN VWRL 
PKSYLSLFYSRIYIVHDEVKDKAFEL 
ELSWVG\ELTNGRHEIVPKDIREEAE 
KYAKESLKEEDESDDDNM 

1572 

7069 

A 

1691 

516 

564 


1573 

7070 

A 

1692 

224 

344 

ILLLGFL VL ASDHLQSK YAL* CPLR 
HLPELNPSLREGSVL 

1574 

7071 

A 

1693 

1 

1237 

MGCRPVGQAGLELLTSARTCFVSD 

LKRGLKIQAAKFNIDGNNECPIDTR 

KQLAENLVVIGGTSMLPGFLHRLLA 

EIRYLVEKPKYKKALGTKTFRIHTPP 

AKANCVAWLGARSLLGYNRFLMF 

QPFRGEETVWSLLPKIQAYCCPFLL 

KYDLSASTFSPDGRVFQVEYAMKA 

VENSSTAIGIRCKDGVVLGV\EK*VL 

SKLYERRVPNKKTFLMFDRNHVGM 

A\VAG\LLADA\RSLADIAREEASNF 

RSNFGNYTIPLKHLADRVAHVWCHA 

\YTTLPVAVRPF\GCQFPC*GPYSVN\ 

DGA\QLYHD*PHPGVSIPVNWGCAI 

GQRPGQAWQRRKLEKNLQMKEMTV 

C\RDIVKEVAKIIYIVHDEVKDKAF\E 

LEPSW\VGELTK\GRHEI\VPK\DIREE 

AEKYAKESLKEEDESDDDNV 

1575 

7072 

A 

1694 

1 

1083 


1 57fi 
1 J /O 


A 

1 £t\C 

1695 

138 

545 

RPGMWSTRSPNSTAWPLSLEPDPG 
MASASTTMHTTTIAEPDPGVS\GLP\ 
DGRMETPTPHP*LTMVVMAGCDV/ 


WO 01/79449 PCT/US01/08656 


SEQID 
NO: of 
nucleotide 
sequence 

SEQID 
NO: of 
peptide 
sequence 

Me 
tho 
d 
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09^770,160 

Nucleotide 
location of 

firct rnflnn 

■ 1191 IpWVII 

for peptide 
sequence 

Nucleotide 
location of last 

rnrion For l93t 

amino acid of 

peptide 

sequence 

Amino acid seqmate ( X=Unkno\vn; *=Stop 
codon; ^possible nucleotide deletion; \=possible 
iiuticuiiuc insertion/ 







AAVPIVLVSLLFWMLRYMYRHKGT 

YHIANEAKGTEFAESADAALQGXDP 

AL\QDAGDSSRK\EYFI 

1577 

7074 

A 

1696 

2 

498 


1578 

7075 

A 

1697 

41 

510 

APSPRRPWGHFTEEDQGLLSTSLWG 

KGEMWKKCWEGRNPWERLPGCPT 

PWTPRGSFEQLWQPCPSA/ILAHPW 

ANPQSPRHHGK\KV)LTSLGRCP*STL 

DDLKGTFAQL\SELHCDKLHVDPEN 

FKLLG\NVLVTV\LAIHF\GKDFTPGG 

CRASWAEDG 

1579 

7076 

A 

1701 

153 

744 

AVNLVPSKDRHLTQSRSQGGGVAN 

PNSGVYSARPSSPPPQIALPAWGTG 

QPQTLQTSPGEGFSVQTGAEVGVG 

RG* PRRRPPRARQGKG VHSTLKKL 

KHYRFHIADGLDRGQAPPLGRPLFL 

GPGWLRTH*GRAGK/QNPQTVGDH 

PGAGAPPSSPALAVSL*/H/CTGLC*I 

PSPACSPGSPSPPRRGSHSPPDAWV 

DP 

1580 

7077 

A 

1702 

35 

569 


1581 

7078 

A 

1703 

509 

2455 

LPAATVLFTSSPNFQIQQPNRTNGD 

VTKKIHDSLESSKISTLKGNLERGYF 

QPSWMTLRGVRLQLEEVPADIVEIA 

RELELEVEPEDVTEFLQSHDQTLAD 

QKLLLMAEQRKWFLEMETTPGGD 

AMNIVEITINNLEYNINFVDKGAAG 

FER1DSNFERSSTAEWVTVKQTQAH 

PSGGIQEGIVIIRDGSRPYTTPEHLPV 

RPNVEEEDSDIDESSPFFLRNYYKA 

AHSFIGRIRFKHSTSLLREAFYHTSK 

EFFSAIRKLASSPEKGKGGIIFTAINP 

FTRSINEIYKIQRRRGKERQLNDCV 

HRSDDANKGPESLGSAGSGQSHDV 

AQGHLQGLVLGQLFILAPLGKFHPE 

EDVKQATSNFENLQKQLARKMKLP 

IFIADAFTARAFRGNPAAVCLLENE 

LDEDMHQKIAREMNLSETAFIRKLH 

PTDNFAQKNMNSTLTFVTLSGELRA 

RRAEDGIVLDLPLYPAHPQDFHEVE 

DLIKTAIGNTLVQDICYSPDTQKLLV 

RLSDVYNRSFLGEPGKLNTENLLQV 

ENTGKVKGLILTLKGEPGGQTQAFD 

FYSRYFAPWVGVAEDPVTGSAHAV 

LSSYWSQHLGKKEMHAFQ/SFPPSE 

ESWEFPFVQT*RVDIRGCAPVVLEG 

TLTAYRWLCCDAAVSNHQVFSA 

1582 

7079 

A 

1704 

1 

1503 


1583 

7080 

A 

1705 

1 

635 


1584 

7081 

A 

1706 

1 

804 

LQFSSALGGGRCRASASSPRRARRR 

GQRPRHPAPRRPQAARPSAAPRARR 

FLSQRPAAAAAAQRAALMQAIKC\ 

VVVGKPKL*GKT\CLLI\SYTTQCHF 

LGEYIPTVFDN\YSA\NVMVDGKTG 

EIWGLWDTAVQEDYDRVTPPYPYP 

A/QADVFLF\CFPFVSPASFENVRAK 

WYLNVRHHCPNNTPMILVGTKLDLR 

DDKDTIEKLKEKKLTVPITYPQGLA\ 


WO 01/79449 PCT/US01/08656 
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MAKEIGAVKYL\ECSALTQRGLKTV 
FDEAIRAVLCPPPVKKRKRKCLLL 

1585 

7082 

A 

1707 

1 

848 

RPRVRAGAENMMFSAAARLSPWE 

GSPSFAENMNDWMPIAKVEYDPLKA 

GSIDGTDEDPHDRAVWRA\MLARY 

VPNKGVIGDPL\LTLFYARLNLQ\TK 

EG\K*KEV\FPRYGDIRRLRLVRDLV 

TGFSKGWAFIEYKEERAVIKAYRD 

ADGLVIDQ\HEIFVDYE\LERTLKGW 

IPRRL\GGGL\GGKKESG\QLEFGGR 

DR\PFRKP\1NLPWKNDLYREGNRE\ 

RRERSRSRERHWDSRTRDRDHDRG 

REKRWQEREPIRVWPDNDNWRRER 

DFRDDRIKGREKKERGK 

1586 

7083 

A 

1708 

3 

3067 


1587 

7084 

A 

1709 

148 

4435 

G IQRKYLKGSIM VSSGCRMRSL WFI 
IVISFLPNTEGFSRAALPFGLVRRELS 
CEGYSIDLRCPGSDVIMIESANYGRT 
DDKICDADPFQMENTDCYLPDAFKI 
MTQRCNNRTQCIWTGSDVFPDPCP 
GTYKYLEVQYECVPYIFVCPGTLKA 
IVDSPCIYEAEQKAGAWCKDPLQA 
ADK1YFMPWTPYRTDTLIEYASLED 
FQNSRQTTTYKLPNRVDGTGFVVY 
DGAVFFNKERTRNIVKFDLRTRIKS 
GEAIINYANYHDTSPYRWGGKTDID 
LAVDENGLWV1YATEQNNGMIVIS 
QLNPYTLRFEATWETVYDKRAASN 
AFMICGVLYVVRSVYQDNESETGK 
NSIDYIYNTRLNRGEYVDVPFPNQY 
QYIAAVDYNPRDNQLYVWNNNFIL 
RYSLEFGPPDPAQVPTTAVTITSSAE 
LFKTIISTTSTTSQKGPMSTTVAGSQ 
EGSKGTKPPPAVSTTKIPP1TNIFPLP 
ERFCEALDSKGIKWPQTQRGMMVE 
RPCPKGTRGTASYLCMISTGTWNPK 
, GPDLSNCTSHWVNQLAQKIRSGEN 
AASLANELAKHTKGPVFAGDVSSS 
k VRLMEQLVDILDAQLQELKPSEKDS 
AGRSYNKAIVDTVDNLLRPEALES 
WKHMNSSEQAHTATMLLDTLEEG 
AFVLADNLLEPTRVSMPTENIVLEV 
AVLSTEGQIQDFKFPLGIKGAGSSIQ 
LSANTVKQNSRNGLAKLVFIIYRSL 
GQFLSTENATIKLGADFIGRNSTIAV 
NSHVISVSINKESSRVYLTDPVLFTL 
PHIDPDNYFNANCSFWNYSERTMM 
GYWSTQGCKLVDTNKTRTTCACSH 
LTNFAILMAHREIAYKDGVHELLLT 
VITWVGIVISLVCLAICIFTFCFFRGL 
QSDRNTIHKNLCINLFIAEFIFLIGID 
KTKYAIACPIFAGLLHFFFLAAFAW 
MCLEGVQLYLMLVEVFESEYSRKK 
YYYVAGYLFPATWGVSAAIDYKS 
YGTEKACWLHVDNYFIWSFIGPVTF 
IILLNIIFLVITLCKMVKHSNTLKPDS 
SRLENIKSW\VLGAFALLCLLGLTW\ 
SFG\LLFINE\ETIVDGHISFTMFNCFP 


WO 01/79449 PCT/US01/08656 
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nnrlpn-tirlp 

sequence 

SEQ ID 
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GGVFHFPSFHCALPKGKVRKRIMA 

KCL/RRHLHTACGGP/LPT* ESPHSS 

VKASTHPEPSARYS\SGTQS\R1RRM 

WNDTVRKQSESSFISGDINSTSTLNQ 

GHSLNNAXRDTSAMDTLPLNGNFN 

NSYSLHKGDYNDSVQWDCGLSLN 

DTAFEKXMIISELVHNNLRGSSKTHN 

LELTLPVKPVIGGSSSEDDAIVADAS 

SL\MHSDNPG/VWELHHK\ELEAPL\I 

PQRTHSLLYQPQKKVKSEGTDSYVS 

QLTAEAEDHLQSPNRDSLYTSMPNL 

R\DSPYPESSPDMEEDLSPSRRSENE 

DIYYKSMPNLGAGHQLQMCYQISR 

GNSDGYIIPINKEGCIPEGDVREGQM 

QLVTSL 

1588 

7085 

B 

1710 

98 

264 

XQVVCKKYRGFTIPEAFRGVHRYLS 
NAYAREEFASTCPDDEEIELAYEQV 
AKALK* 

1589 

7086 

A 

1711 

155 

1217 

DPPSPVPAPPSSPRDGHFLVPDATM 

AEEQPQV\ELFVKAGSDGAKIGNCP 

FSQRLFMVLWLKGVTFNVTTVDTK 

RRTETVQKLCPGGQLPFLLYGTEVH 

P\DTTKIEEFLEAVL\CPPR\YPK\LAA 

LNPEVQHSWGWD1FAKFFLPNIQEF 

QTPALN*QSGRRGFLESP*KVLDNY 

LT\SPPSPEEVDETSC*KIEGVSQ\RK 

RLDGQRRPHPWLDLQTCCPKVTH* 

VQW\CKRK*PGNSPHPPKAFPGKC 

HRVP* SKMPYAPGKNSPSHPVPDDE 

EIELRPMSKVAKALQISPSLGLPSTP 

SIFSTKAPGGFHIATPMGHTPKLASG 

QGILGDIEPAKGVVEEGMRERNGGP 

GSDF 

1590 

7087 

A 

1712 

39 

256 

LSVKMEEGILPCSLYETTITDSKT*Q 
G*YI/EDFRLVFLINLNAYILKKMLV 
NHLR*NMRDNSETYRRIVRIV 

1591 

7088 

A 

1713 

1193 

1436 

PQSDFLDTLPQTSPIIP/1*EVPTGLVC 
YSSRVNKRAAPPASIPVPACSPPSPV 
SNPPHPVSNPPHVSAPLPCSSHQTQ 
QAP 

1592 

7089 

A 

1715 

2 

533 

ARDSFLAAMASHRVLLLLCLAGLWF 

VSEAGPTGTG\ESKCPLMVKV\LD\A 

VRGSPAHQMWP*HVFRKPR**PPW 

EAILPSGTRKTQLSLGEIAHGAHKLR 

EGICTNGIYKSGK*DTKFFTGKGTL 

GIFPHFPLRHCQEVGISTGQRTSGPR 

RLTPLAALLEPLTPISTTGCSFTNSQ 

GN 

1593 

7090 

A 

1716 

38 

661 

APSPRRPWVISQRRTKATITSLWGKN 

VNVE\DAGGET\LGRL\LVVYPMDP 

RGFFDSFGNLSSASAI\MGNP\KVKA 

HG\KKVLT\SLGDAIK\HL\DDLKG/T 

PFAQA*SELHC*QACNVGS*GTFKL 

PGEILLVT/LFWAIPFSGKEFHPLRCQ 

VFLGQEQKMAEDGD\WS\GQCPCSF 

QITTELTGP*MQSFSRIWLYSCKQLQ 

IINLFLLRDHQ 

1594 

7091 

A 

1717 

32 

487 

SRRHGSSLWGKVNVEDAGGETLGR 
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